Bacula-users

Re: [Bacula-users] bacula hang waiting for storage

2008-12-02 09:21:23
Subject: Re: [Bacula-users] bacula hang waiting for storage
From: Pasi Kärkkäinen <pasik AT iki DOT fi>
To: Arno Lehmann <al AT its-lehmann DOT de>
Date: Tue, 2 Dec 2008 16:17:26 +0200
On Thu, Nov 27, 2008 at 05:53:41PM +0100, Arno Lehmann wrote:
> Hi,
> 
> 27.11.2008 15:10, Pasi Kärkkäinen wrote:
> > On Thu, Nov 27, 2008 at 08:14:45AM +0100, Arno Lehmann wrote:
> >> Hi,
> >>
> >> 26.11.2008 21:22, Bob Hetzel wrote:
> >>> I've got bacula currently in a hung state with the following interesting 
> >>> info.  When I run a status storage produces the following...
> >> Is your Bacula still stuck? If so, and you have gdb installed, and a 
> >> Bacula with debug symbols, now might be a good time to see what it's 
> >> doing...
> ...
> > I have also seen this lately.. but that was with Bacula 2.5.18.
> > 
> > I could make that hang happen multiple times, but I'm not totally sure what
> > caused that..
> 
> Well, if you can recreate the issue it's worth the effort building 
> Bacula with debug information so you get usable backtraces.
> 
> If the problem happens again, you can use gdb to create a backtrace, 
> showing the developers more details about what happens and thus 
> enabling them to fix the issue.
> 
> I would recommend that now.
> 

And now it's stuck again.. 

Last output in bconsole:

01-Dec 20:01 bacula-sd JobId 4231: Forward spacing Volume "Pool4-Vol-0111" to 
file:block 0:218.
01-Dec 20:04 bacula-sd JobId 4231: Error: block.c:568 Write error at 509:3263 
on device "IBM-LTO3-Drive" (/dev/nst0). ERR=Input/output error.
01-Dec 20:04 bacula-sd JobId 4231: Error: Error writing final EOF to tape. This 
Volume may not be readable. dev.c:1723 ioctl MTWEOF error on "IBM-LTO3-Drive" 
(/dev/nst0). ERR=Input/output error.
01-Dec 20:04 bacula-sd JobId 4231: End of medium on Volume "807NNNL3" 
Bytes=482,782,454,784 Blocks=7,483,606 at 01-Dec-2008 20:04.
01-Dec 20:04 bacula-sd JobId 4231: 3307 Issuing autochanger "unload slot 7, 
drive 0" command.

bconsole is still usable after this..

"sta director" shows a lot of jobs waiting for execution (since this was a
'copy pool uncopied jobs to tape'-job), but nothing happens really.


"sta storage" makes bconsole hang.. last output:

Device status:
Autochanger "IBM-LTO3-AutoChanger" with devices:
   "IBM-LTO3-Drive" (/dev/nst0)
Device "FSDevice0" (/mnt/backup1/pool00) is not open.
Device "FSDevice1" (/mnt/backup1/pool01) is not open.
Device "FSDevice2" (/mnt/backup1/pool02) is not open.
Device "FSDevice3" (/mnt/backup1/pool03) is not open.
Device "FSDevice4" (/mnt/backup1/pool04) is mounted with:
    Volume:      Pool4-Vol-0111
    Pool:        *unknown*
    Media type:  File4
    Total Bytes Read=3,848,656,896 Blocks Read=59,658 Bytes/block=64,512
    Positioned at File=0 Block=3,848,592,601
Device "IBM-LTO3-Drive" (/dev/nst0) is not open.
    Device is being initialized.
    Drive 0 is not loaded.
====

Used Volume status:

<hangs here, have to kill the bconsole>

What kind of backtrace do you want? From which daemon? bacula-sd? 

-- Pasi

-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users