Try setting "persistent binding" on your HBA's.
Bobby Williams
Sr. Systems Engineer
4250-A Benton Drive
Chattanooga, Tennessee 37406
Pager: 423-819-3336
Phone: 423-624-2600 x122
________________________________
From: veritas-bu-admin AT mailman.eng.auburn DOT edu on behalf of Kampen, Scott
Sent: Thu 5/5/2005 12:37 PM
To: veritas-bu AT mailman.eng.auburn DOT edu
Subject: [Veritas-bu] Drives unusable
Hello group,
I'm running NB 5.0 on a Sun 480R - Solaris 9 with two 280R's as media servers.
My tape unit is an IBM with 12 LTO fiber attached tape drives. Here's the
problem:
Daily my servers (both the master and the two media servers) will lose
visibility to some of the tape drives. I know this by running cfgadm -al and
it shows something like the following:
c3 fc-fabric connected configured unknown
c3::500507630f404301 tape connected configured unknown
c3::500507630f404302 tape connected configured unknown
c3::500507630f404303 tape connected configured unusable
c3::500507630f404304 tape connected configured unusable
c3::500507630f404305 tape connected configured unknown
c3::500507630f404306 tape connected configured unknown
c3::50060482cafd8e8c disk connected configured unknown
c4 fc-fabric connected configured unknown
c4::500507630f404307 tape connected configured unknown
c4::500507630f404308 tape connected configured unknown
c4::500507630f404309 tape connected configured unknown
c4::500507630f40430a tape connected configured unknown
c4::500507630f40430b tape connected configured unknown
c4::500507630f40430c tape connected configured unknown
c4::50060482cafd8e83 disk connected configured unknown
Notice the two drives that show "unusable". Now to fix this I've been running
the following command.
cfgadm -c configure c3
In my message log I notice reference to
May 5 08:03:20 pong transport rejected
May 5 08:03:20 pong genunix: [ID 408114 kern.info]
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404304,0 (st17) offline May 5
08:03:20 pong scsi: [ID 107833 kern.warning] WARNING:
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404303,0 (st16):
May 5 08:03:20 pong transport rejected
May 5 08:03:20 pong genunix: [ID 408114 kern.info]
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404303,0 (st16) offline May 5
08:06:20 pong bptm[3279]: [ID 498531 daemon.error] user scsi ioctl() failed,
may be timeout, errno = 2, Error 0 May 5 08:06:20 pong bptm[13159]: [ID 498531
daemon.error] user scsi ioctl() failed, may be timeout, errno = 2, Error 0 May
5 08:06:20 pong bptm[3279]: [ID 498531 daemon.error] user scsi ioctl() failed,
may be timeout, errno = 2, No such file or directory May 5 08:06:20 pong
bptm[13159]: [ID 498531 daemon.error] user scsi ioctl() failed, may be timeout,
errno = 2, No such file or directory May 5 08:06:49 pong scsi: [ID 365881
kern.info] /pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404303,0 (st16):
May 5 08:06:49 pong <IBM Ultrium Gen 2 LTO>
May 5 08:06:49 pong scsi: [ID 799468 kern.info] st16 at fp1: name
w500507630f404303,0, bus address 61300 May 5 08:06:49 pong genunix: [ID 936769
kern.info] st16 is /pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404303,0
May 5 08:06:49 pong genunix: [ID 408114 kern.info]
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404303,0 (st16) online May 5
08:06:51 pong scsi: [ID 365881 kern.info]
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404301,0 (st14):
May 5 08:06:51 pong <IBM Ultrium Gen 2 LTO>
Does this problem have to do with a NetBackup timeout value or is this a SUN
issue? I've got SUN support on these systems and will work the issue with them
if needed, but I didn't know if someone else on this group might have run into
the same issue. Maybe there is a kernel setting that needs tweaking?
Thanks for any help in advance.
Scott
|