Amanda-Users

Re: Attempted upgrade to Amanda 2.5.0: hangs in amcheck

2008-04-15 18:20:58
Subject: Re: Attempted upgrade to Amanda 2.5.0: hangs in amcheck
From: FL <lengyel AT gmail DOT com>
To: "Dustin J. Mitchell" <dustin AT zmanda DOT com>
Date: Tue, 15 Apr 2008 18:15:27 -0400
On Tue, Apr 15, 2008 at 5:55 PM, Dustin J. Mitchell <dustin AT zmanda DOT com> 
wrote:
> On Tue, Apr 15, 2008 at 5:34 PM, FL <lengyel AT gmail DOT com> wrote:
> >  Maybe it's time to contribute a rewrite
> >  to chg-zd-mtx--unless there is an explicit inventory command I could
> >  invoke in the changer.conf script)
>
> That would be great!  That script doesn't get a lot of attention.

Ok that's good to know. If it turns out that an explicit inventory
is needed, say, after an unload, I'll add it.
>
> >  Now with the status correct, I have this:
> >  -sh-3.1$ amcheck Daily
> >  Amanda Tape Server Host Check
> >  -----------------------------
> >  Holding disk /home/amanda/holding-disk: 104765 MB disk space
> >  available, using 104665 MB
> >
> >  which hangs indefinitely
>
> Hmm.  Can you use 'strace' to see what it's up to?  Is the changer
> doing anything interesting?
>
> Dustin
>
> --
> Storage Software Engineer
> http://www.zmanda.com
>

So far chg-zd-mtx is behaving, and amtape is also behaving:

-sh-3.1$ /usr/lib/amanda/chg-zd-mtx -slot next
2 /dev/nst0
-sh-3.1$ /usr/lib/amanda/chg-zd-mtx -slot current
2 /dev/nst0
-sh-3.1$ amtape Daily reset
changer: got exit: 0 str: 1 /dev/nst0
amtape: changer is reset, slot 1 is loaded.
-sh-3.1$ amtape Daily slot next
changer: got exit: 0 str: 2 /dev/nst0
amtape: changed to slot 2 on /dev/nst0
-sh-3.1$


Now for the strace of amcheck Daily:

it starts promisingly enough until it hangs on the

rt_sigaction(SIGPIPE, {SIG_IGN}, NULL, 8) = 0
open("/var/log/amanda/amcheck.temp.5536",
O_RDWR|O_CREAT|O_TRUNC|O_LARGEFILE, 0600) = 4
unlink("/var/log/amanda/amcheck.temp.5536") = 0
clone(Amanda Tape Server Host Check
-----------------------------
Holding disk /home/amanda/holding-disk: 104765 MB disk space
available, using 104665 MB
child_stack=0, flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD,
child_tidptr=0xb7efabc8) = 5537
clone(child_stack=0,
flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD,
child_tidptr=0xb7efabc8) = 5539
wait4(-1,

... the wait completes and then

wait4(-1, [{WIFEXITED(s) && WEXITSTATUS(s) == 1}], 0, NULL) = 5539
--- SIGCHLD (Child exited) @ 0 (0) ---
wait4(-1,

...