Bacula-users

Re: [Bacula-users] Backup Catalog fatal error

2013-04-09 01:26:54
Subject: Re: [Bacula-users] Backup Catalog fatal error
From: rashid ahmed <rashidkahmed AT gmail DOT com>
To: Marcin Haba <ganiuszka AT gmail DOT com>
Date: Tue, 9 Apr 2013 08:19:34 +0300
Hi,

Many Thanks! for replying as i am new in bacula server.
after typed this command lsscsi -g i got below o/p
[root@smhpmblb00 ~]# lsscsi -g
[0:2:0:0]    disk    IBM      ServeRAID M5014  2.12  /dev/sda   /dev/sg0
[1:0:0:0]    cd/dvd  HL-DT-ST DVDRAM GT30N     IS09  /dev/sr0   /dev/sg1
[5:0:0:0]    disk    IBM      2145             0000  /dev/sdb   /dev/sg2
[5:0:0:1]    disk    IBM      2145             0000  /dev/sdc   /dev/sg3
[5:0:1:0]    disk    IBM      2145             0000  /dev/sdd   /dev/sg4
[5:0:1:1]    disk    IBM      2145             0000  /dev/sde   /dev/sg5
[5:0:2:0]    tape    IBM      ULT3580-TD5      B170  /dev/st0   /dev/sg6
[5:0:2:1]    mediumx IBM      3573-TL          A.40  /dev/sch0  /dev/sg7
[6:0:0:0]    disk    IBM      2145             0000  /dev/sdf   /dev/sg8
[6:0:0:1]    disk    IBM      2145             0000  /dev/sdg   /dev/sg9
[6:0:1:0]    disk    IBM      2145             0000  /dev/sdh   /dev/sg10
[6:0:1:1]    disk    IBM      2145             0000  /dev/sdi   /dev/sg11
[6:0:2:0]    tape    IBM      ULT3580-TD5      B170  /dev/st1   /dev/sg12
[6:0:2:1]    mediumx IBM      3573-TL          A.40  /dev/sch1  /dev/sg13


################################################## /VAR/LOG/MESSAGE/   ################################
Below i got error message on 5th april after that all backup failed, please advise how to fix this issue

 

Apr  5 01:09:46 smhpmblb00 udevd-work[3045]: rename(/dev/disk/by-id/scsi-360050768028081cedc00000000000026.udev-tmp, /dev/disk/by-id/scsi-360050768028081cedc00000000000026) failed: No such file or directory

Apr  5 02:21:25 smhpmblb00 udevd-work[9089]: symlink(../../sdd, /dev/disk/by-id/scsi-360050768028081cedc00000000000026.udev-tmp) failed: File exists

Apr  5 03:38:19 smhpmblb00 udevd-work[15005]: symlink(../../sdc, /dev/disk/by-id/scsi-360050768028081cedc00000000000026.udev-tmp) failed: File exists

Apr  5 03:48:30 smhpmblb00 udevd-work[16423]: rename(/dev/disk/by-id/scsi-360050768028081cedc00000000000027.udev-tmp, /dev/disk/by-id/scsi-360050768028081cedc00000000000027) failed: No such file or directory

Apr  5 03:48:30 smhpmblb00 udevd-work[16434]: rename(/dev/disk/by-id/scsi-360050768028081cedc00000000000027.udev-tmp, /dev/disk/by-id/scsi-360050768028081cedc00000000000027) failed: No such file or directory

Apr  5 03:48:30 smhpmblb00 udevd-work[16044]: rename(/dev/disk/by-id/wwn-0x60050768028081cedc00000000000026.udev-tmp, /dev/disk/by-id/wwn-0x60050768028081cedc00000000000026) failed: No such file or directory

Apr  5 04:24:21 smhpmblb00 udevd-work[19252]: symlink(../../sde, /dev/disk/by-id/scsi-360050768028081cedc00000000000026.udev-tmp) failed: File exists

Apr  5 04:44:53 smhpmblb00 udevd-work[20472]: rename(/dev/disk/by-id/wwn-0x60050768028081cedc00000000000026.udev-tmp, /dev/disk/by-id/wwn-0x60050768028081cedc00000000000026) failed: No such file or directory

Apr  5 05:36:06 smhpmblb00 udevd-work[25031]: symlink(../../sde, /dev/disk/by-id/scsi-360050768028081cedc00000000000026.udev-tmp) failed: File exists

Apr  5 05:41:16 smhpmblb00 udevd-work[25450]: symlink(../../sde, /dev/disk/by-id/scsi-360050768028081cedc00000000000026.udev-tmp) failed: File exists

Apr  5 06:06:47 smhpmblb00 udevd-work[27339]: symlink(../../sdb, /dev/disk/by-id/scsi-360050768028081cedc00000000000026.udev-tmp) failed: File exists

Apr  5 06:53:00 smhpmblb00 udevd-work[31473]: symlink(../../sdd, /dev/disk/by-id/wwn-0x60050768028081cedc00000000000026.udev-tmp) failed: File exists

Apr  5 07:03:21 smhpmblb00 udevd-work[31893]: rename(/dev/disk/by-id/scsi-360050768028081cedc00000000000026.udev-tmp, /dev/disk/by-id/scsi-360050768028081cedc00000000000026) failed: No such file or directory

Apr  5 07:23:53 smhpmblb00 udevd-work[1161]: symlink(../../sdc, /dev/disk/by-id/scsi-360050768028081cedc00000000000026.udev-tmp) failed: File exists

Apr  5 07:39:19 smhpmblb00 udevd-work[3122]: rename(/dev/disk/by-id/scsi-360050768028081cedc00000000000026.udev-tmp, /dev/disk/by-id/scsi-360050768028081cedc00000000000026) failed: No such file or directory

Apr  5 07:54:38 smhpmblb00 udevd-work[4104]: rename(/dev/disk/by-id/scsi-360050768028081cedc00000000000026.udev-tmp, /dev/disk/by-id/scsi-360050768028081cedc00000000000026) failed: No such file or directory

Apr  5 08:04:49 smhpmblb00 udevd-work[5324]: rename(/dev/disk/by-id/scsi-360050768028081cedc00000000000026.udev-tmp, /dev/disk/by-id/scsi-360050768028081cedc00000000000026) failed: No such file or directory

Apr  5 08:15:09 smhpmblb00 udevd-work[6167]: symlink(../../sdd, /dev/disk/by-id/scsi-360050768028081cedc00000000000026.udev-tmp) failed: File exists

Apr  5 08:30:30 smhpmblb00 udevd-work[7371]: rename(/dev/disk/by-id/wwn-0x60050768028081cedc00000000000026.udev-tmp, /dev/disk/by-id/wwn-0x60050768028081cedc00000000000026) failed: No such file or directory

Apr  5 08:35:31 smhpmblb00 udevd-work[7768]: symlink(../../sdd, /dev/disk/by-id/scsi-360050768028081cedc00000000000026.udev-tmp) failed: File exists

Apr  5 08:51:45 smhpmblb00 kernel: st0: Error 70000 (driver bt 0x0, host bt 0x7).

Apr  5 10:05:08 smhpmblb00 kernel: lpfc 0000:1f:00.1: 1:(0):0713 SCSI layer issued Device Reset (2, 0) return x2002

Apr  5 10:06:16 smhpmblb00 kernel: INFO: task multipathd:14739 blocked for more than 120 seconds.

Apr  5 10:06:16 smhpmblb00 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.

Apr  5 10:06:16 smhpmblb00 kernel: multipathd    D 0000000000000006     0 14739      1 0x00000080





On Mon, Apr 8, 2013 at 6:51 PM, Marcin Haba <ganiuszka AT gmail DOT com> wrote:
2013/4/8 rashid ahmed <rashidkahmed AT gmail DOT com>:
> Hi,
>
> Below is error message we are facing in bacula server.
> smhpmblb00-dir shell command: run BeforeJob
> "/etc/bacula/make_catalog_backup.pl MyCatalog"
>
> smhpmblb00-dir Using Device "Drive-1"
>
>  Start Backup JobId 1379, Job=BackupCatalog.2013-04-08_08.36.42_14
>
> smhpmblb00-sd 3304 Issuing autochanger "load slot 1, drive 0" command.
>
>  3991 Bad autochanger "loaded? drive 1" command: ERR=Child exited with code
> 1.
>
> Results=cannot open SCSI device '/dev/sg8' - No such file or directory
>
>  3991 Bad autochanger "loaded? drive 0" command: ERR=Child exited with code
> 1.
>
> Results=cannot open SCSI device '/dev/sg8' - No such file or directory
>
>  3991 Bad autochanger "loaded? drive 0" command: ERR=Child exited with code
> 1.
>
> Results=cannot open SCSI device '/dev/sg8' - No such file or directory
>
>  3991 Bad autochanger "loaded? drive 0" command: ERR=Child exited with code
> 1.
>
> Results=cannot open SCSI device '/dev/sg8' - No such file or directory
>
> smhpmblb00-dir
>
> Error: Bacula smhpmblb00-dir 5.2.12 (12Sep12):
>
>   Build OS:               x86_64-unknown-linux-gnu redhat Enterprise release
>
>   JobId:                  1379
>
>   Job:                    BackupCatalog.2013-04-08_08.36.42_14
>
>   Backup Level:           Full
>
>   Client:                 "SMC-BACKUP-Machine-fd" 5.2.12 (12Sep12)
> x86_64-unknown-linux-gnu,redhat,Enterprise release
>
>   FileSet:                "Catalog" 2012-12-12 07:00:00
>
>   Pool:                   "Daily" (From Job resource)
>
>   Catalog:                "MyCatalog" (From Pool resource)
>
>   Storage:                "TS3200" (From command line)
>
>   Scheduled time:         08-Apr-2013 08:35:43
>
>   Start time:             08-Apr-2013 08:37:39
>
>   End time:               08-Apr-2013 08:42:39
>
>   Elapsed time:           5 mins
>
>   Priority:               10
>
>   FD Files Written:       0
>
>   SD Files Written:       0
>
>   FD Bytes Written:       0 (0 B)
>
>   SD Bytes Written:       0 (0 B)
>
>   Rate:                   0.0 KB/s
>
>   Software Compression:   None
>
>   VSS:                    no
>
>   Encryption:             no
>
>   Accurate:               no
>
>   Volume name(s):
>
>   Volume Session Id:      96
>
>   Volume Session Time:    1364803824
>
>   Last Volume Bytes:      4,615,028,296,704 (4.615 TB)
>
>   Non-fatal FD errors:    1
>
>   SD Errors:              1
>
>   FD termination status:  Error
>
>   SD termination status:  Error
>
>   Termination:            *** Backup Error ***
>
> smhpmblb00-fd Fatal error: job.c:2395 Bad response to Append Data command.
> Wanted 3000 OK data
>
> , got 3903 Error append data
>
> smhpmblb00-sd Fatal error: 3992 Bad autochanger "load slot 1, dr
> ive 0": ERR=Child died from signal 15: Termination.
>
> Results=Program killed by Bacula (timeout)
>
> If anyone knows the solution please let me know as soon as possible.

Hi,

It seems that your autochanger has another a SCSI generic special file
(/dev/sg8) than is expected by Bacula Storage Daemon. It may be caused
through:
1. Changes on SCSI bus (eg. connected new additional device)
2. Wrong configuration of the Bacula Storage Daemon.

You should correct your bacula-sd.conf to current state of your
system. I mean your /dev/sg8 definition. You can check special SCSI
generic file for your autochanger eg. by next command:

lsscsi -g

Better solution is creating UDEV rules for your autochanger. In this
case your autochanger  and tape drives will be always fixed names
defined by you.

Regards.
Marcin Haba (gani)


--
"Większej miłości nikt nie ma nad tę, jak gdy kto życie swoje kładzie
za przyjaciół swoich." Jezus Chrystus

------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users
<Prev in Thread] Current Thread [Next in Thread>