[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: How to Reset ipropd



Well, I guess the log file doesn't always get updated on the slave  
like you'd think, so things may be working better than I thought.

> -rw-------    1 root     other    18767872 2006-03-16 00:31 heimdal.db
> -rw-------    1 root     other          24 2006-03-16 00:30  
> heimdal.log

In the above case the slave log file hasn't been updated, and still  
shows version 2.  The master is at version 3, which includes an extra  
password change.  However kadmin -l get on the slave shows the  
password change *has* been propagated.

Reading the ipropd code, the iprop protocol seems simple enough.  I  
do *not* understand the log file processing though.  In particular I  
do not understand why slaves grow their log files beyond the 24 bytes  
needed to log a version number.

If you dump the DB, delete the log file, and reload the DB that seems  
to create a new 24-byte log file on the master.  However one time I  
did that .../heimdal/sbin/dump_log could not dump the new log file.   
In that case a slave would download a fresh copy of the DB, and  
create its own 24-byte log file, but it did not download any further  
updates.  dump_log likewise could not dump the log file on the  
slave.  Using od I saw nothing significantly different about the non- 
working log file.  (One word was different in a way consistent with a  
different time stamp, but I think the rest of the file appeared  
identical to a working one.  If there's interest I'll see if I can  
find a copy of it.)

On Mar 7, 2006, at 5:58 PM, Henry B. Hotz wrote:

>
> On Mar 7, 2006, at 2:24 PM, Björn Sandell wrote:
>
>> On Tue, 7 Mar 2006 09:07:31 -0800
>> "Henry B. Hotz" <hotz@jpl.nasa.gov> wrote:
>>
>>> I'm seeing a lot more flakiness in iprop for 0.7.2 than I saw in
>>> 0.6.3.  It's possible that I'm causing some of the problems myself,
>>> but I'd like to compare notes.
>>>
>>> Specifically I'm seeing situations where iprop is running, but the
>>> slave db is "frozen" and doesn't update.
>>
>> We're running 0.6.3 and I have never seen that particular failure  
>> mode.
>> It happens that a slave crashes or just hangs though. In our test
>> setup were presently running 0.7.1 master and 0.7.2 client and iprop
>> seem much more stable there (there's a lot less load on the test  
>> setup
>> though :). All this on redhat AS 3.1 and heimdal was configured with
>> '--disable-berkeley-db'.
>
> In production there's a firewall that gets in the way sometimes.   
> I've wound up with multi-gigabyte log files due to some third-order  
> fallout from that.  I don't blame iprod for all of those probelms.
>
> That doesn't apply in my test environment though.  I'm doing a  
> scripted upgrade from 0.6.3 which deletes the log files on both  
> master and slave(s).  Everything seems stable for a while.  Then  
> something happens (sometimes my fault) and it just can't/won't resync.
>
> Anyone have an opinion on deleting the "generation" field as part  
> of the dump/reload?
>
> ---------------------------------------------------------------------- 
> ------
> The opinions expressed in this message are mine,
> not those of Caltech, JPL, NASA, or the US Government.
> Henry.B.Hotz@jpl.nasa.gov, or hbhotz@oxy.edu
>
>
>