Bacula-users

Re: [Bacula-users] bacula-sd hanging after tape gets full + unload

2008-12-11 08:02:59
Subject: Re: [Bacula-users] bacula-sd hanging after tape gets full + unload
From: Pasi Kärkkäinen <pasik AT iki DOT fi>
To: bacula-users AT lists.sourceforge DOT net
Date: Thu, 11 Dec 2008 13:58:00 +0200
On Thu, Nov 13, 2008 at 02:26:20PM +0200, Pasi Kärkkäinen wrote:
> Hello list!
> 
> I'm using Bacula 2.5.19 and trying 'copy jobs' feature to copy jobs from
> disk volumes/pools to tape.
> 
> Sometimes bacula-sd seems to get stuck.. it hangs without doing anything.
> Now it happened when tape got full and Bacula started to change the tape on
> the drive (using autoloader):
> 
> bacula-sd JobId 3082: Start Copying JobId 3082, 
> Job=CopyPool4UncopiedToTape.2008-11-13_10.53.04.54
> bacula-sd JobId 3082: Using Device "IBM-LTO3-Drive"
> bacula-sd JobId 3082: Ready to read from volume "Pool4-Vol-0127" on device 
> "FSDevice4" (/mnt/backup1/pool04).
> bacula-sd JobId 3082: Forward spacing Volume "Pool4-Vol-0127" to file:block 
> 0:218.
> bacula-sd JobId 3082: End of Volume "756NNNL3" at 764:10067 on device 
> "IBM-LTO3-Drive" (/dev/nst0). Write of 64512 bytes got -1.
> bacula-sd JobId 3082: Re-read of last block succeeded.
> bacula-sd JobId 3082: End of medium on Volume "756NNNL3" 
> Bytes=725,237,130,240 Blocks=11,241,894 at 13-Nov-2008 11:51.
> bacula-sd JobId 3082: 3307 Issuing autochanger "unload slot 3, drive 0" 
> command.
> 
> <nothing happens after this>
> 
> 
> *sta
> Status available for:
>      1: Director
>      2: Storage
>      3: Client
>      4: All
> Select daemon type for status (1-4): 2
> 
> ...
> 
> Device status:
> Autochanger "IBM-LTO3-AutoChanger" with devices:
>    "IBM-LTO3-Drive" (/dev/nst0)
> Device "FSDevice0" (/mnt/backup1/pool00) is not open.
> Device "FSDevice1" (/mnt/backup1/pool01) is not open.
> Device "FSDevice2" (/mnt/backup1/pool02) is not open.
> Device "FSDevice3" (/mnt/backup1/pool03) is not open.
> Device "FSDevice4" (/mnt/backup1/pool04) is mounted with:
>     Volume:      Pool4-Vol-0127
>     Pool:        Pool4
>     Media type:  File4
>     Total Bytes Read=1,649,507,328 Blocks Read=25,569 Bytes/block=64,512
>     Positioned at File=0 Block=1,649,507,534
> Device "IBM-LTO3-Drive" (/dev/nst0) is not open.
>     Device is being initialized.
>     Drive 0 is not loaded.
> ====
> 
> Used Volume status:
> 
> <hangs here and nothing happens>
> 
> 
> I can exit bconsole by pressing CTRL+C multiple times.. if I restart
> bconsole and run that again, it gets stuck again.. 
> 
> I tried 'strace -p <pid>' to see what bacula-sd is doing:
> 
> # strace -p 7339
> Process 7339 attached - interrupt to quit
> select(5, [4], NULL, NULL, NULL <unfinished ...>
> Process 7339 detached
> 
> So.. bacula-sd seems to be stuck on select() .. 
> 
> Running 'mtx' seems to work fine.. at the same time when bacula-sd is stuck.
> 
> # mtx -f /dev/sg3 status
>   Storage Changer /dev/sg3:1 Drives, 8 Slots ( 0 Import/Export )
> Data Transfer Element 0:Empty
>       Storage Element 1:Full :VolumeTag=179MMML3
>       Storage Element 2:Full :VolumeTag=658NNNL3
>       Storage Element 3:Full :VolumeTag=756NNNL3
>       Storage Element 4:Full :VolumeTag=177MMML3
>       Storage Element 5:Full :VolumeTag=655NNNL3
>       Storage Element 6:Full :VolumeTag=656NNNL3
>       Storage Element 7:Full :VolumeTag=657NNNL3
>       Storage Element 8:Full :VolumeTag=CLNU38L1
> 
> 
> Any ideas how to fix this? Other than restarting Bacula.. 
> 
> I don't see any IO errors in dmesg and/or messages.
> 

Replying myself.. this was a bug in Bacula 2.5 SVN version, and it can be
fixed with this patch:

http://www.mail-archive.com/bacula-devel AT lists.sourceforge DOT 
net/msg03646.html

-- Pasi

------------------------------------------------------------------------------
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users

<Prev in Thread] Current Thread [Next in Thread>
  • Re: [Bacula-users] bacula-sd hanging after tape gets full + unload, Pasi Kärkkäinen <=