Hi all,
I'm sure this is the place for asking this :)
The configuration:
Hardware: SunFire 6800
Robot: ADIC Scalar 100 with two drives, 50 slots connected via SCSI.
OS: Solaris 8
Storage: Clariion CX600
Netbackup 4.5GA
Everytime the backup starts, I get this errors and then both drives go DOWN, no
more backups are possible.
--
Jan 24 06:47:24 foo.com tl8d[6024]: [ID 182334 daemon.error] TL8(0) going to UP
state
Jan 24 07:07:39 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 07:07:39 foo.com SCSI transport failed: reason 'timeout': giving
up
Jan 24 07:07:39 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@1,0 (st242):
Jan 24 07:07:39 foo.com SCSI transport failed: reason 'reset': retrying
command
Jan 24 07:07:42 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@1,0 (st242):
Jan 24 07:07:42 foo.com Error for Command: rezero/rewind
Error Level: Fatal
Jan 24 07:07:42 foo.com scsi: [ID 107833 kern.notice] Requested Block: 1
Error Block: 1
Jan 24 07:07:42 foo.com scsi: [ID 107833 kern.notice] Vendor: SONY
Serial Number:
Jan 24 07:07:42 foo.com scsi: [ID 107833 kern.notice] Sense Key: Unit
Attention
Jan 24 07:07:42 foo.com scsi: [ID 107833 kern.notice] ASC: 0x29 (power on,
reset, or bus reset occurred), ASCQ: 0x0, FRU: 0x0
Jan 24 07:12:10 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@1,0 (st242):
Jan 24 07:12:10 foo.com SCSI transport failed: reason 'timeout': giving
up
Jan 24 07:12:10 foo.com bptm[7122]: [ID 832037 daemon.error] scsi command
failed, may be timeout, scsi_pkt.us_reason = 4
Jan 24 07:19:20 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 07:19:20 foo.com SCSI transport failed: reason 'timeout': giving
up
Jan 24 07:19:20 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@1,0 (st242):
Jan 24 07:19:20 foo.com SCSI transport failed: reason 'reset': retrying
command
Jan 24 07:19:22 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@1,0 (st242):
Jan 24 07:19:22 foo.com Error for Command: rezero/rewind
Error Level: Fatal
Jan 24 07:19:22 foo.com scsi: [ID 107833 kern.notice] Requested Block: 1
Error Block: 1
Jan 24 07:19:22 foo.com scsi: [ID 107833 kern.notice] Vendor: SONY
Serial Number:
Jan 24 07:19:22 foo.com scsi: [ID 107833 kern.notice] Sense Key: Unit
Attention
Jan 24 07:19:22 foo.com scsi: [ID 107833 kern.notice] ASC: 0x29 (power on,
reset, or bus reset occurred), ASCQ: 0x0, FRU: 0x0
Jan 24 07:21:59 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@1,0 (st242):
Jan 24 07:21:59 foo.com SCSI transport failed: reason 'timeout': giving
up
Jan 24 07:26:28 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 07:26:28 foo.com SCSI transport failed: reason 'timeout': giving
up
Jan 24 07:26:28 foo.com bptm[8140]: [ID 832037 daemon.error] scsi command
failed, may be timeout, scsi_pkt.us_reason = 4
Jan 24 21:43:33 foo.com tl8d[6024]: [ID 666797 daemon.notice] Adding media ID
000169 to unmountable media list
Jan 24 21:43:33 foo.com tl8d[6024]: [ID 692047 daemon.error] TL8(0) drive 1
(device 0) is being DOWNED, status: Unable to open drive
Jan 24 21:43:33 foo.com tl8d[6024]: [ID 229259 daemon.error] Check integrity of
the drive, drive path, and media
Jan 24 21:43:33 foo.com tl8d[6024]: [ID 690496 daemon.notice] Removing media ID
000169 from unmountable media list
Jan 24 22:05:03 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 22:05:03 foo.com SCSI transport failed: reason 'timeout': giving
up
Jan 24 22:23:40 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 22:23:40 foo.com SCSI transport failed: reason 'timeout': giving
up
Jan 24 22:50:28 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 22:50:28 foo.com SCSI transport failed: reason 'timeout': giving
up
Jan 24 22:53:28 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4 (qus2):
Jan 24 22:53:28 foo.com Parity Error
Jan 24 22:55:20 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4 (qus2):
Jan 24 22:55:20 foo.com Parity Error
Jan 24 22:56:05 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4 (qus2):
Jan 24 22:56:05 foo.com Parity Error
Jan 24 22:58:05 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 22:58:05 foo.com SCSI transport failed: reason 'timeout': giving
up
Jan 24 23:08:15 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 23:08:15 foo.com SCSI transport failed: reason 'timeout': giving
up
Jan 24 23:15:44 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 23:15:44 foo.com SCSI transport failed: reason 'timeout': giving
up
Jan 24 23:20:50 foo.com scsi: [ID 107833 kern.warning] WARNING:
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 23:20:50 foo.com Error for Command: write
Error Level: Fatal
Jan 24 23:20:50 foo.com scsi: [ID 107833 kern.notice] Requested Block: 2
Error Block: 2
------------
Sorry for the long log..
The revision for the QLogic Ultra3 Scsi drives is 11.8.0,REV=2001.11.29.10.46.
Robot firmware:
Library: 2.60.0040
Drive: I couldn't check it.
RMU: 131A.00012
Any help/clue will very welcome.
Thanks a lot!
Francisco
|