Bacula-users

Re: [Bacula-users] bacula hang waiting for storage

2008-12-02 09:51:51
Subject: Re: [Bacula-users] bacula hang waiting for storage
From: Pasi Kärkkäinen <pasik AT iki DOT fi>
To: Nils Blanck-Wehde <nils.blanck-wehde AT backofficeservice DOT biz>
Date: Tue, 2 Dec 2008 16:48:24 +0200
On Tue, Dec 02, 2008 at 03:30:06PM +0100, Nils Blanck-Wehde wrote:
> Hi!
> 
> Just wanted to let you know that I came across the exact same error
> 
> Error writing final EOF to tape. This Volume may not be readable.
> 
> a couple of times with 2.4.2 using a Quantum DLT VS1 drive connected to
> Adaptec 29160LP. I don't think that the tape is really defective as
> bacula states. I could do working backups on these tapes later. Maybe
> its a problem with positioning (forwarding) the tape to the right position?
> If there is still interest in this issue I might search for the
> corresponding job-output.
> 

I also think the tape itself is fine, since this has happened many times
now.. I don't think all of the tapes are bad.

I'm also using Adaptec 29160 SCSI HBA to connect to the tape library.

I wonder what would be the best way to debug this.. 

Now after bacula hang the first time I'm not able to get it running again..
it always just hangs when I do "sta storage".. and nothing happens for real.

I guess the tape drive/library is in some bad state? Or the SCSI driver? 

Let's see if rebooting the server gets it running again..

-- Pasi

> Nils
> 
> Pasi Kärkkäinen schrieb:
> > On Thu, Nov 27, 2008 at 05:53:41PM +0100, Arno Lehmann wrote:
> >   
> >> Hi,
> >>
> >> 27.11.2008 15:10, Pasi Kärkkäinen wrote:
> >>     
> >>> On Thu, Nov 27, 2008 at 08:14:45AM +0100, Arno Lehmann wrote:
> >>>       
> >>>> Hi,
> >>>>
> >>>> 26.11.2008 21:22, Bob Hetzel wrote:
> >>>>         
> >>>>> I've got bacula currently in a hung state with the following 
> >>>>> interesting 
> >>>>> info.  When I run a status storage produces the following...
> >>>>>           
> >>>> Is your Bacula still stuck? If so, and you have gdb installed, and a 
> >>>> Bacula with debug symbols, now might be a good time to see what it's 
> >>>> doing...
> >>>>         
> >> ...
> >>     
> >>> I have also seen this lately.. but that was with Bacula 2.5.18.
> >>>
> >>> I could make that hang happen multiple times, but I'm not totally sure 
> >>> what
> >>> caused that..
> >>>       
> >> Well, if you can recreate the issue it's worth the effort building 
> >> Bacula with debug information so you get usable backtraces.
> >>
> >> If the problem happens again, you can use gdb to create a backtrace, 
> >> showing the developers more details about what happens and thus 
> >> enabling them to fix the issue.
> >>
> >> I would recommend that now.
> >>
> >>     
> >
> > And now it's stuck again.. 
> >
> > Last output in bconsole:
> >
> > 01-Dec 20:01 bacula-sd JobId 4231: Forward spacing Volume "Pool4-Vol-0111" 
> > to file:block 0:218.
> > 01-Dec 20:04 bacula-sd JobId 4231: Error: block.c:568 Write error at 
> > 509:3263 on device "IBM-LTO3-Drive" (/dev/nst0). ERR=Input/output error.
> > 01-Dec 20:04 bacula-sd JobId 4231: Error: Error writing final EOF to tape. 
> > This Volume may not be readable. dev.c:1723 ioctl MTWEOF error on 
> > "IBM-LTO3-Drive" (/dev/nst0). ERR=Input/output error.
> > 01-Dec 20:04 bacula-sd JobId 4231: End of medium on Volume "807NNNL3" 
> > Bytes=482,782,454,784 Blocks=7,483,606 at 01-Dec-2008 20:04.
> > 01-Dec 20:04 bacula-sd JobId 4231: 3307 Issuing autochanger "unload slot 7, 
> > drive 0" command.
> >
> > bconsole is still usable after this..
> >
> > "sta director" shows a lot of jobs waiting for execution (since this was a
> > 'copy pool uncopied jobs to tape'-job), but nothing happens really.
> >
> >
> > "sta storage" makes bconsole hang.. last output:
> >
> > Device status:
> > Autochanger "IBM-LTO3-AutoChanger" with devices:
> >    "IBM-LTO3-Drive" (/dev/nst0)
> > Device "FSDevice0" (/mnt/backup1/pool00) is not open.
> > Device "FSDevice1" (/mnt/backup1/pool01) is not open.
> > Device "FSDevice2" (/mnt/backup1/pool02) is not open.
> > Device "FSDevice3" (/mnt/backup1/pool03) is not open.
> > Device "FSDevice4" (/mnt/backup1/pool04) is mounted with:
> >     Volume:      Pool4-Vol-0111
> >     Pool:        *unknown*
> >     Media type:  File4
> >     Total Bytes Read=3,848,656,896 Blocks Read=59,658 Bytes/block=64,512
> >     Positioned at File=0 Block=3,848,592,601
> > Device "IBM-LTO3-Drive" (/dev/nst0) is not open.
> >     Device is being initialized.
> >     Drive 0 is not loaded.
> > ====
> >
> > Used Volume status:
> >
> > <hangs here, have to kill the bconsole>
> >
> > What kind of backtrace do you want? From which daemon? bacula-sd? 
> >
> > -- Pasi
> >   

-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users