Bacula-users

Re: [Bacula-users] bacula hang waiting for storage

2008-12-02 15:24:25
Subject: Re: [Bacula-users] bacula hang waiting for storage
From: Pasi Kärkkäinen <pasik AT iki DOT fi>
To: Julien Cigar <jcigar AT ulb.ac DOT be>
Date: Tue, 2 Dec 2008 22:21:16 +0200
On Tue, Dec 02, 2008 at 06:56:34PM +0100, Julien Cigar wrote:
> Yes the SCSI card is an Adaptec (I replaced it today with a QSI Logic to
> see if I have better results).
> The OS is FreeBSD 7.0-p6 (32 bits) with the ahc driver.
> What's strange is that I can write many jobs without any problems, but
> then it suddenly fails, always with the same error (Error writing final
> EOF to tape.)
> I posted a message on the freebsd-scsi mailing list some days ago, but I
> didn't get any answer :
> http://lists.freebsd.org/pipermail/freebsd-scsi/2008-November/003706.html
> I'm less and less sure that it's a driver/OS issue, but rather a Bacula
> bug (but I could be wrong).
>

Thanks for the info.

Could this be related?:

"[Bacula-users] FreeBSD, Bacula, and a Dell Autochanger 122T SCSI Timeouts"
http://sourceforge.net/mailarchive/forum.php?thread_name=1228142720.2805.223.camel%40soundwave.ws.pitbpa0.priv.collaborativefusion.com&forum_name=bacula-users

-- Pasi

 
> Best regards,
> Julien
> 
> On Tue, 2008-12-02 at 18:18 +0200, Pasi Kärkkäinen wrote:
> > On Tue, Dec 02, 2008 at 06:11:11PM +0200, Pasi Kärkkäinen wrote:
> > > On Tue, Dec 02, 2008 at 03:56:33PM +0100, Julien Cigar wrote:
> > > > Same problem here with a Sony SDX-700C
> > > > 
> > > 
> > > Thanks for the report. Do you also have Adaptec SCSI HBA?
> > > 
> > 
> > And which OS?
> > 
> > I'm running CentOS 5.2 x86 32bit. 
> > 
> > -- Pasi
> > 
> > > 
> > > > On Tue, 2008-12-02 at 15:30 +0100, Nils Blanck-Wehde wrote:
> > > > > Hi!
> > > > > 
> > > > > Just wanted to let you know that I came across the exact same error 
> > > > > Error writing final EOF to tape. This Volume may not be readable.
> > > > > a couple of times with 2.4.2 using a Quantum DLT VS1 drive connected
> > > > > to Adaptec 29160LP. I don't think that the tape is really defective as
> > > > > bacula states. I could do working backups on these tapes later. Maybe
> > > > > its a problem with positioning (forwarding) the tape to the right
> > > > > position?
> > > > > If there is still interest in this issue I might search for the
> > > > > corresponding job-output.
> > > > > 
> > > > > Nils
> > > > > 
> > > > > Pasi Kärkkäinen schrieb: 
> > > > > > On Thu, Nov 27, 2008 at 05:53:41PM +0100, Arno Lehmann wrote:
> > > > > >   
> > > > > > > Hi,
> > > > > > > 
> > > > > > > 27.11.2008 15:10, Pasi Kärkkäinen wrote:
> > > > > > >     
> > > > > > > > On Thu, Nov 27, 2008 at 08:14:45AM +0100, Arno Lehmann wrote:
> > > > > > > >       
> > > > > > > > > Hi,
> > > > > > > > > 
> > > > > > > > > 26.11.2008 21:22, Bob Hetzel wrote:
> > > > > > > > >         
> > > > > > > > > > I've got bacula currently in a hung state with the 
> > > > > > > > > > following interesting 
> > > > > > > > > > info.  When I run a status storage produces the following...
> > > > > > > > > >           
> > > > > > > > > Is your Bacula still stuck? If so, and you have gdb 
> > > > > > > > > installed, and a 
> > > > > > > > > Bacula with debug symbols, now might be a good time to see 
> > > > > > > > > what it's 
> > > > > > > > > doing...
> > > > > > > > >         
> > > > > > > ...
> > > > > > >     
> > > > > > > > I have also seen this lately.. but that was with Bacula 2.5.18.
> > > > > > > > 
> > > > > > > > I could make that hang happen multiple times, but I'm not 
> > > > > > > > totally sure what
> > > > > > > > caused that..
> > > > > > > >       
> > > > > > > Well, if you can recreate the issue it's worth the effort 
> > > > > > > building 
> > > > > > > Bacula with debug information so you get usable backtraces.
> > > > > > > 
> > > > > > > If the problem happens again, you can use gdb to create a 
> > > > > > > backtrace, 
> > > > > > > showing the developers more details about what happens and thus 
> > > > > > > enabling them to fix the issue.
> > > > > > > 
> > > > > > > I would recommend that now.
> > > > > > > 
> > > > > > >     
> > > > > > 
> > > > > > And now it's stuck again.. 
> > > > > > 
> > > > > > Last output in bconsole:
> > > > > > 
> > > > > > 01-Dec 20:01 bacula-sd JobId 4231: Forward spacing Volume 
> > > > > > "Pool4-Vol-0111" to file:block 0:218.
> > > > > > 01-Dec 20:04 bacula-sd JobId 4231: Error: block.c:568 Write error 
> > > > > > at 509:3263 on device "IBM-LTO3-Drive" (/dev/nst0). 
> > > > > > ERR=Input/output error.
> > > > > > 01-Dec 20:04 bacula-sd JobId 4231: Error: Error writing final EOF 
> > > > > > to tape. This Volume may not be readable. dev.c:1723 ioctl MTWEOF 
> > > > > > error on "IBM-LTO3-Drive" (/dev/nst0). ERR=Input/output error.
> > > > > > 01-Dec 20:04 bacula-sd JobId 4231: End of medium on Volume 
> > > > > > "807NNNL3" Bytes=482,782,454,784 Blocks=7,483,606 at 01-Dec-2008 
> > > > > > 20:04.
> > > > > > 01-Dec 20:04 bacula-sd JobId 4231: 3307 Issuing autochanger "unload 
> > > > > > slot 7, drive 0" command.
> > > > > > 
> > > > > > bconsole is still usable after this..
> > > > > > 
> > > > > > "sta director" shows a lot of jobs waiting for execution (since 
> > > > > > this was a
> > > > > > 'copy pool uncopied jobs to tape'-job), but nothing happens really.
> > > > > > 
> > > > > > 
> > > > > > "sta storage" makes bconsole hang.. last output:
> > > > > > 
> > > > > > Device status:
> > > > > > Autochanger "IBM-LTO3-AutoChanger" with devices:
> > > > > >    "IBM-LTO3-Drive" (/dev/nst0)
> > > > > > Device "FSDevice0" (/mnt/backup1/pool00) is not open.
> > > > > > Device "FSDevice1" (/mnt/backup1/pool01) is not open.
> > > > > > Device "FSDevice2" (/mnt/backup1/pool02) is not open.
> > > > > > Device "FSDevice3" (/mnt/backup1/pool03) is not open.
> > > > > > Device "FSDevice4" (/mnt/backup1/pool04) is mounted with:
> > > > > >     Volume:      Pool4-Vol-0111
> > > > > >     Pool:        *unknown*
> > > > > >     Media type:  File4
> > > > > >     Total Bytes Read=3,848,656,896 Blocks Read=59,658 
> > > > > > Bytes/block=64,512
> > > > > >     Positioned at File=0 Block=3,848,592,601
> > > > > > Device "IBM-LTO3-Drive" (/dev/nst0) is not open.
> > > > > >     Device is being initialized.
> > > > > >     Drive 0 is not loaded.
> > > > > > ====
> > > > > > 
> > > > > > Used Volume status:
> > > > > > 
> > > > > > <hangs here, have to kill the bconsole>
> > > > > > 
> > > > > > What kind of backtrace do you want? From which daemon? bacula-sd? 
> > > > > > 
> > > > > > -- Pasi
> > > > > >   
> 

-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users