Bacula-users

[Bacula-users] MSL6030: added a second tape reader -> problem

2010-12-01 08:47:27
Subject: [Bacula-users] MSL6030: added a second tape reader -> problem
From: Simone Martina <smartina AT noc.skylogicnet DOT com>
To: bacula-users <bacula-users AT lists.sourceforge DOT net>
Date: Wed, 01 Dec 2010 14:25:46 +0100
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi at all,
yesterday I've added a second LTO-4 ULTRIUM 1640 tape reader into my HP
MSL6030 SCSI tape library. I'm using Bacula 5.0.2 on Centos 5 x64.
I've updated my config in this way:

## bacula-sd.conf
Autochanger {
  Name = "Autochanger"
  Device = Drive-1, Drive-2
  Changer Device = /dev/sg5
  Changer Command = "/etc/bacula/mtx-changer %c %o %S %a %d"
}
Device {
  Name = Drive-1
  Drive Index = 0
  Media Type = LTO-4
  Archive Device = /dev/nst1
  AutomaticMount = yes;               # when device opened, read it
  AlwaysOpen = yes;
  RemovableMedia = yes;
  RandomAccess = no;
#  Maximum File Size = 5GB
 Changer Command = "/etc/bacula/mtx-changer %c %o %S %a %d"
 Changer Device = /dev/sg5
 AutoChanger = yes
  Spool Directory = /srv/baculaspool
  Maximum Spool Size = 50G
# Enable the Alert command only if you have the mtx package loaded
Alert Command = "sh -c 'tapeinfo -f %c |grep TapeAlert|cat'"
# If you have smartctl, enable this, it has more info than tapeinfo
Alert Command = "sh -c 'smartctl -H -l error %c'"
}
Device {
  Name = Drive-2
  Drive Index = 1
  Media Type = LTO-4
  Archive Device = /dev/nst0
  AutomaticMount = yes;               # when device opened, read it
  AlwaysOpen = yes;
  RemovableMedia = yes;
  RandomAccess = no;
#  Maximum File Size = 5GB
 Changer Command = "/etc/bacula/mtx-changer %c %o %S %a %d"
 Changer Device = /dev/sg5
 AutoChanger = yes
  Spool Directory = /srv/baculaspool
  Maximum Spool Size = 50G
# Enable the Alert command only if you have the mtx package loaded
Alert Command = "sh -c 'tapeinfo -f %c |grep TapeAlert|cat'"
# If you have smartctl, enable this, it has more info than tapeinfo
Alert Command = "sh -c 'smartctl -H -l error %c'"
}

##bacula-dir.conf (storage section):
# LTO-4 Storage
Storage {
  Name = TAPE
  Address = backup2.lan
  SDPort = 9103
  Password = "sdpassword"
  Device = Drive-1
  Media Type = LTO-4
  Maximum Concurrent Jobs = 20
}

# LTO-4 Storage 2nd drive
Storage {
  Name = TAPE2
  Address = backup2.lan
  SDPort = 9103
  Password = "sdpassword"
  Device = Drive-2
  Media Type = LTO-4
  Maximum Concurrent Jobs = 20
}

# MSL6030 Autochanger
Storage {
  Name = MSL6000
  Address = backup2.lan
  SDPort = 9103
  Password = "sdpassword"
  Device = Autochanger
  Media Type = LTO-4
  Autochanger = Yes
  Maximum Concurrent Jobs = 20
}

The old drive is Drive-1, meanwhile the new one is Drive-2. I've
connected both the library (/dev/sg5) than the two reader to the same
SCSI chain using 3 SCSI cables and an ending SCSI terminator.
The library is happy, but when I've tried to run a backup job, Bacula
choices the first tape (the old one): after 2 or 3 megabytes the job
fails with this messages:

01-Dec 12:41 backup2.lan-sd JobId 2032: Wrote label to prelabeled Volume
"NN6036L4" on device "Drive-1" (/dev/nst1)
01-Dec 12:41 backup2.lan-sd JobId 2032: Error: block.c:577 Write error
at 0:12 on device "Drive-1" (/dev/nst1). ERR=Input/output error.
01-Dec 12:42 backup2.lan-sd JobId 2032: Error: block.c:318 Volume data
error at 0:4294967295!
Block checksum mismatch in block=11 len=64512: calc=669ee9d3 blk=a7b18121
01-Dec 12:42 backup2.lan-sd JobId 2032: Error: Re-read last block at EOT
failed. ERR=block.c:318 Volume data error at 0:4294967295!
Block checksum mismatch in block=11 len=64512: calc=669ee9d3 blk=a7b18121

and tries to load a new tape from the same pool (incremental pool).
After used any of available tapes, Bacula request a new one but fails again.

Now I've cancelled all jobs and, after erased a tape, I've tried to run
a manual backup jobs forcing use the new tape reader: this jobs ends
successful!
Till yesterday morning I've always use the old reader without troubles,
but now I got this error into dmesg:

st1: Current: sense key: Aborted Command
    Add. Sense: Information unit iuCRC error detected

INFO: task mt:19730 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
mt            D ffff810001587120     0 19730   7392
(NOTLB)
 ffff810027741cd8 0000000000000082 0000000000000001 ffff810020ed2d68
 ffff81003efa5928 0000000000000007 ffff81003c138080 ffff81003ff3f080
 00000a275d268df9 0000000000003627 ffff81003c138268 000000028807aa5a
Call Trace:
 [<ffffffff80063167>] wait_for_completion+0x79/0xa2
 [<ffffffff8008cfa1>] default_wake_function+0x0/0xe
 [<ffffffff88293e85>] :st:st_do_scsi+0x1f4/0x221
 [<ffffffff88294994>] :st:st_int_ioctl+0x5f2/0x92b
 [<ffffffff80008d56>] __handle_mm_fault+0x5f3/0x1039
 [<ffffffff88296aba>] :st:st_ioctl+0xaa5/0xe1f
 [<ffffffff80066b88>] do_page_fault+0x4fe/0x874
 [<ffffffff80042181>] do_ioctl+0x55/0x6b
 [<ffffffff80030204>] vfs_ioctl+0x457/0x4b9
 [<ffffffff800b7605>] audit_syscall_entry+0x180/0x1b3
 [<ffffffff8004c633>] sys_ioctl+0x59/0x78
 [<ffffffff8005d28d>] tracesys+0xd5/0xe0

Any suggestions?

Thanks,

Simone
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iQEcBAEBAgAGBQJM9kzaAAoJEC9yUzzw9wFDjzsIAJBFXZ0iruR8tmksNzENBqJ1
2XnFLdK8Mcqt4s0udfhgqT5E/HAIVIeg3Yd8y4761Tw9HIfrczZxayM5C+PWWFMW
+w6oFAUrm2uSnBXr1t39m5Ye0Mz5dAlBVqs1/vUl7YB4d/uBCHQPi4qPp+S1/klU
FZS6CLEOoHUmGFCjU2he81f5QdmDcgGnXTKuR2of7Mpm3lh3WUuyYNhfmZ7d6oz9
BxNCQaEnysMypod2x3/qtOkiMTKdy5PA3hAPYllMQjpbtKabTB+PEXOyJt8zBYP7
Hv28Zew0pnO8moyx5yKFein07XOL9uRHSTS2ClrPOGNBD8Rrmx0JGw3YN5QBdiA=
=RwwE
-----END PGP SIGNATURE-----

------------------------------------------------------------------------------
Increase Visibility of Your 3D Game App & Earn a Chance To Win $500!
Tap into the largest installed PC base & get more eyes on your game by
optimizing for Intel(R) Graphics Technology. Get started today with the
Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs.
http://p.sf.net/sfu/intelisp-dev2dev
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users

<Prev in Thread] Current Thread [Next in Thread>
  • [Bacula-users] MSL6030: added a second tape reader -> problem, Simone Martina <=