Veritas-bu

[Veritas-bu] Robot Drives downed

2005-01-31 11:53:53
Subject: [Veritas-bu] Robot Drives downed
From: fpuente AT itconvergence DOT com (Francisco Puente)
Date: Mon, 31 Jan 2005 08:53:53 -0800
Hi all,
I'm sure this is the place for asking this :)
 
The configuration:
Hardware: SunFire 6800
Robot: ADIC Scalar 100 with two drives, 50 slots connected via SCSI.
OS: Solaris 8
Storage: Clariion CX600
Netbackup 4.5GA

Everytime the backup starts, I get this errors and then both drives go DOWN, no 
more backups are possible.
--
Jan 24 06:47:24 foo.com tl8d[6024]: [ID 182334 daemon.error] TL8(0) going to UP 
state
Jan 24 07:07:39 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 07:07:39 foo.com         SCSI transport failed: reason 'timeout': giving 
up
Jan 24 07:07:39 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@1,0 (st242):
Jan 24 07:07:39 foo.com         SCSI transport failed: reason 'reset': retrying 
command
Jan 24 07:07:42 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@1,0 (st242):
Jan 24 07:07:42 foo.com         Error for Command: rezero/rewind           
Error Level: Fatal
Jan 24 07:07:42 foo.com scsi: [ID 107833 kern.notice]   Requested Block: 1      
                   Error Block: 1
Jan 24 07:07:42 foo.com scsi: [ID 107833 kern.notice]   Vendor: SONY            
                   Serial Number:
Jan 24 07:07:42 foo.com scsi: [ID 107833 kern.notice]   Sense Key: Unit 
Attention
Jan 24 07:07:42 foo.com scsi: [ID 107833 kern.notice]   ASC: 0x29 (power on, 
reset, or bus reset occurred), ASCQ: 0x0, FRU: 0x0
Jan 24 07:12:10 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@1,0 (st242):
Jan 24 07:12:10 foo.com         SCSI transport failed: reason 'timeout': giving 
up
Jan 24 07:12:10 foo.com bptm[7122]: [ID 832037 daemon.error] scsi command 
failed, may be timeout, scsi_pkt.us_reason = 4
Jan 24 07:19:20 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 07:19:20 foo.com         SCSI transport failed: reason 'timeout': giving 
up
Jan 24 07:19:20 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@1,0 (st242):
Jan 24 07:19:20 foo.com         SCSI transport failed: reason 'reset': retrying 
command
Jan 24 07:19:22 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@1,0 (st242):
Jan 24 07:19:22 foo.com         Error for Command: rezero/rewind           
Error Level: Fatal
Jan 24 07:19:22 foo.com scsi: [ID 107833 kern.notice]   Requested Block: 1      
                   Error Block: 1
Jan 24 07:19:22 foo.com scsi: [ID 107833 kern.notice]   Vendor: SONY            
                   Serial Number:
Jan 24 07:19:22 foo.com scsi: [ID 107833 kern.notice]   Sense Key: Unit 
Attention
Jan 24 07:19:22 foo.com scsi: [ID 107833 kern.notice]   ASC: 0x29 (power on, 
reset, or bus reset occurred), ASCQ: 0x0, FRU: 0x0
Jan 24 07:21:59 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@1,0 (st242):
Jan 24 07:21:59 foo.com         SCSI transport failed: reason 'timeout': giving 
up
Jan 24 07:26:28 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 07:26:28 foo.com         SCSI transport failed: reason 'timeout': giving 
up
Jan 24 07:26:28 foo.com bptm[8140]: [ID 832037 daemon.error] scsi command 
failed, may be timeout, scsi_pkt.us_reason = 4
Jan 24 21:43:33 foo.com tl8d[6024]: [ID 666797 daemon.notice] Adding media ID 
000169 to unmountable media list
Jan 24 21:43:33 foo.com tl8d[6024]: [ID 692047 daemon.error] TL8(0) drive 1 
(device 0) is being DOWNED, status: Unable to open drive
Jan 24 21:43:33 foo.com tl8d[6024]: [ID 229259 daemon.error] Check integrity of 
the drive, drive path, and media
Jan 24 21:43:33 foo.com tl8d[6024]: [ID 690496 daemon.notice] Removing media ID 
000169 from unmountable media list
Jan 24 22:05:03 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 22:05:03 foo.com         SCSI transport failed: reason 'timeout': giving 
up
Jan 24 22:23:40 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 22:23:40 foo.com         SCSI transport failed: reason 'timeout': giving 
up
Jan 24 22:50:28 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 22:50:28 foo.com         SCSI transport failed: reason 'timeout': giving 
up
Jan 24 22:53:28 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4 (qus2):
Jan 24 22:53:28 foo.com         Parity Error
Jan 24 22:55:20 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4 (qus2):
Jan 24 22:55:20 foo.com         Parity Error
Jan 24 22:56:05 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4 (qus2):
Jan 24 22:56:05 foo.com         Parity Error
Jan 24 22:58:05 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 22:58:05 foo.com         SCSI transport failed: reason 'timeout': giving 
up
Jan 24 23:08:15 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 23:08:15 foo.com         SCSI transport failed: reason 'timeout': giving 
up
Jan 24 23:15:44 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 23:15:44 foo.com         SCSI transport failed: reason 'timeout': giving 
up
Jan 24 23:20:50 foo.com scsi: [ID 107833 kern.warning] WARNING: 
/ssm@0,0/pci@1a,600000/pci@1/scsi@4/st@2,0 (st244):
Jan 24 23:20:50 foo.com         Error for Command: write                   
Error Level: Fatal
Jan 24 23:20:50 foo.com scsi: [ID 107833 kern.notice]   Requested Block: 2      
                   Error Block: 2
------------
Sorry for the long log..
The revision for the QLogic Ultra3 Scsi drives is 11.8.0,REV=2001.11.29.10.46.
Robot firmware:
Library: 2.60.0040
Drive: I couldn't check it.
RMU: 131A.00012

Any help/clue will very welcome.
Thanks a lot!

Francisco


<Prev in Thread] Current Thread [Next in Thread>
  • [Veritas-bu] Robot Drives downed, Francisco Puente <=