Bacula-users

Re: [Bacula-users] bacula watchdog killing tape-loading

2008-08-07 06:49:39
Subject: Re: [Bacula-users] bacula watchdog killing tape-loading
From: Nils Blanck-Wehde <nils.blanck-wehde AT backofficeservice DOT biz>
Date: Thu, 07 Aug 2008 12:49:14 +0200
Hi John,

thanks for your help. I am quite new to bacula and it seems to take some time to fully understand it :-)
I am not sure whether it really is a timeout problem.
I increased all timeout values to 10 minutes and the killing still occurs (after 2:20min):

07-Aug 12:20 company_bacula-dir JobId 217: Start Backup JobId 217, Job=Fileserver_Lexware_Exchange_to_Tape.2008-08-07_12.20.03
07-Aug 12:20 company_bacula-dir JobId 217: Using Device "DLT-Drive-1"
07-Aug 12:20 company_bacula-sd JobId 217: 3301 Issuing autochanger "loaded? drive 0" command.
07-Aug 12:20 company_bacula-sd JobId 217: 3302 Autochanger "loaded? drive 0", result: nothing loaded.
07-Aug 12:20 company_bacula-sd JobId 217: 3304 Issuing autochanger "load slot 8, drive 0" command.
*messages
07-Aug 12:22 company_bacula-sd JobId 217: Fatal error: 3992 Bad autochanger "load slot 8, drive 0": ERR=Child died from signal 15: Termination.
Results=Loading media from Storage Element 8 into drive 0...done
Program killed by Bacula watchdog (timeout)

07-Aug 12:20 company-appsrv-fd JobId 217: Fatal error: ../../filed/job.c:1817 Bad response to Append Data command. Wanted 3000 OK data, got 3903 Error append data


I doublechecked the time needed by the autochanger for an unload/load operation: I did a couple of operations and they all terminated in less than four minutes.
I think there is a general problem with the autochanger because btape test throws the following error when testing the autochanger:

3301 Issuing autochanger "loaded" command.
3991 Bad autochanger command: /usr/lib/bacula/mtx-changer /dev/sg3 loaded 1 /dev/nst0 0
3991 result="": ERR=Child died from signal 15: Termination
You must correct this error or the Autochanger will not work.

When I run this command "/usr/lib/bacula/mtx-changer /dev/sg3 loaded 1 /dev/nst0 0" manually, both as root and as user bacula, it returns "1":

[root@company-Backupserver ~]# su - bacula
-bash-3.2$ /usr/lib/bacula/mtx-changer /dev/sg3 loaded 1 /dev/nst0 0
1
-bash-3.2$


Could it be a permission problem? What Uid Gid should mtx-changer have? Could some automatic mechanism on my CentOS installation have changed the permissions of the autochanger device to a lower level?

I am a little stuck here.

Thanks for all help!

Nils


John Drescher schrieb:
On Wed, Aug 6, 2008 at 1:42 PM, Nils Blanck-Wehde
<nils.blanck-wehde AT backofficeservice DOT biz> wrote:
  
Hello list,

I just encountered this issue:

06-Aug 19:31 company_bacula-sd JobId 210: Fatal error: 3992 Bad
autochanger "load slot 8, drive 0": ERR=Child died from signal 15:
Termination.
Results=Loading media from Storage Element 8 into drive 0...done
Program killed by Bacula watchdog (timeout)


Earlier today I got this message:

06-Aug 17:10 company_bacula-dir JobId 208: Using Device "DLT-Drive-1"
06-Aug 17:10 company_bacula-sd JobId 208: 3301 Issuing autochanger "loaded? drive 0" command.
06-Aug 17:10 company_bacula-sd JobId 208: 3302 Autochanger "loaded? drive 0", result: nothing loaded.
06-Aug 17:10 company_bacula-sd JobId 208: 3304 Issuing autochanger "load slot 1, drive 0" command.
06-Aug 17:13 company-appsrv-fd JobId 208: Fatal error: ../../filed/job.c:1817 Bad response to Append Data command. Wanted 3000 OK data
, got 3903 Error append data

06-Aug 17:15 company_bacula-sd JobId 208: Fatal error: 3992 Bad autochanger "load slot 1, drive 0": ERR=Child died from signal 15: Termination.
Results=Loading media from Storage Element 1 into drive 0...done
Program killed by Bacula watchdog (timeout)

    
The default maximum changer wait is 5 minutes. If the changer does not
complete in 5 minutes bacula will kill the mtx-changer script.

See Maximum Changer Wait in

http://bacula.org/en/rel-manual/Storage_Daemon_Configuratio.html

John


!DSPAM:4899e40991086581397638!

  

--

 

BackOfficeService - Beratung und Service für Ihre IT -

 

Anschrift:

Niederkastenholzer Str. 40

53881 Euskirchen

Telefon:   +49 2255 953204

Fax:          +49 2255 953208

Mobil:      +49 177 3397547

Bankverbindung:

Raiffeisenbank Rheinbach Voreifel eG

Kto. Nr. 340286014 (BLZ 370 696 27)

Online:

info@backofficeservice.biz

www.backofficeservice.biz

 

-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users