Veritas-bu

[Veritas-bu] Drives unusable

2005-05-05 12:37:41
Subject: [Veritas-bu] Drives unusable
From: SKampen AT verisign DOT com (Kampen, Scott)
Date: Thu, 5 May 2005 09:37:41 -0700
This is a multi-part message in MIME format.

------_=_NextPart_001_01C55190.C1D6BC8D
Content-Type: text/plain;
        charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

Hello group,
=20
I'm running NB 5.0 on a Sun 480R - Solaris 9 with two 280R's as media
servers.  My tape unit is an IBM with 12 LTO fiber attached tape drives.
Here's the problem:
=20
Daily my servers (both the master and the two media servers) will lose
visibility to some of the tape drives.  I know this by running cfgadm
-al and it shows something like the following:
=20
c3 fc-fabric connected configured unknown
c3::500507630f404301 tape connected configured unknown
c3::500507630f404302 tape connected configured unknown
c3::500507630f404303 tape connected configured unusable
c3::500507630f404304 tape connected configured unusable
c3::500507630f404305 tape connected configured unknown
c3::500507630f404306 tape connected configured unknown
c3::50060482cafd8e8c disk connected configured unknown
c4 fc-fabric connected configured unknown
c4::500507630f404307 tape connected configured unknown
c4::500507630f404308 tape connected configured unknown
c4::500507630f404309 tape connected configured unknown
c4::500507630f40430a tape connected configured unknown
c4::500507630f40430b tape connected configured unknown
c4::500507630f40430c tape connected configured unknown
c4::50060482cafd8e83 disk connected configured unknown

Notice the two drives that show "unusable".  Now to fix this I've been
running the following command.

cfgadm -c configure c3

In my message log I notice reference to

May  5 08:03:20 pong    transport rejected
May  5 08:03:20 pong genunix: [ID 408114 kern.info]
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404304,0 (st17) offline
May  5 08:03:20 pong scsi: [ID 107833 kern.warning] WARNING:
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404303,0 (st16):
May  5 08:03:20 pong    transport rejected
May  5 08:03:20 pong genunix: [ID 408114 kern.info]
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404303,0 (st16) offline
May  5 08:06:20 pong bptm[3279]: [ID 498531 daemon.error] user scsi
ioctl() failed, may be timeout, errno =3D 2, Error 0 May  5 08:06:20 =
pong
bptm[13159]: [ID 498531 daemon.error] user scsi ioctl() failed, may be
timeout, errno =3D 2, Error 0 May  5 08:06:20 pong bptm[3279]: [ID =
498531
daemon.error] user scsi ioctl() failed, may be timeout, errno =3D 2, No
such file or directory May  5 08:06:20 pong bptm[13159]: [ID 498531
daemon.error] user scsi ioctl() failed, may be timeout, errno =3D 2, No
such file or directory May  5 08:06:49 pong scsi: [ID 365881 kern.info]
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404303,0 (st16):
May  5 08:06:49 pong    <IBM Ultrium Gen 2 LTO>
May  5 08:06:49 pong scsi: [ID 799468 kern.info] st16 at fp1: name
w500507630f404303,0, bus address 61300 May  5 08:06:49 pong genunix: [ID
936769 kern.info] st16 is
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404303,0
May  5 08:06:49 pong genunix: [ID 408114 kern.info]
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404303,0 (st16) online May
5 08:06:51 pong scsi: [ID 365881 kern.info]
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404301,0 (st14):
May  5 08:06:51 pong    <IBM Ultrium Gen 2 LTO>=20



Does this problem have to do with a NetBackup timeout value or is this a
SUN issue?  I've got SUN support on these systems and will work the
issue with them if needed, but I didn't know if someone else on this
group might have run into the same issue.  Maybe there is a kernel
setting that needs tweaking?

Thanks for any help in advance.
Scott

------_=_NextPart_001_01C55190.C1D6BC8D
Content-Type: text/html;
        charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
charset=3Dus-ascii">
<META NAME=3D"Generator" CONTENT=3D"MS Exchange Server version =
6.5.7232.25">
<TITLE>Drives unusable</TITLE>
</HEAD>
<BODY>
<!-- Converted from text/rtf format -->

<P><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier New">Hello =
group,</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier =
New">&nbsp;</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier New">I'm running =
NB 5.0 on a Sun 480R - Solaris 9 with two 280R's as media servers.&nbsp; =
My tape unit is an IBM with 12 LTO fiber attached tape drives.&nbsp; =
Here's the problem:</FONT></SPAN></P>

<P><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier =
New">&nbsp;</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier New">Daily my =
servers (both the master and the two media servers) will lose visibility =
to some of the tape drives.&nbsp; I know this by running cfgadm -al and =
it shows something like the following:</FONT></SPAN></P>

<P><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier =
New">&nbsp;</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier New">c3 =
fc-fabric connected configured unknown</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier =
New">c3::500507630f404301 tape connected configured =
unknown</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier =
New">c3::500507630f404302 tape connected configured =
unknown</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier =
New">c3::500507630f404303 tape connected configured<B> =
unusable</B></FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier =
New">c3::500507630f404304 tape connected configured</FONT><B> <FONT =
SIZE=3D2 FACE=3D"Courier New">unusable</FONT></B></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier =
New">c3::500507630f404305 tape connected configured =
unknown</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier =
New">c3::500507630f404306 tape connected configured =
unknown</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier =
New">c3::50060482cafd8e8c disk connected configured =
unknown</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier New">c4 =
fc-fabric connected configured unknown</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier =
New">c4::500507630f404307 tape connected configured =
unknown</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier =
New">c4::500507630f404308 tape connected configured =
unknown</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier =
New">c4::500507630f404309 tape connected configured =
unknown</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier =
New">c4::500507630f40430a tape connected configured =
unknown</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier =
New">c4::500507630f40430b tape connected configured =
unknown</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier =
New">c4::500507630f40430c tape connected configured =
unknown</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier =
New">c4::50060482cafd8e83 disk connected configured =
unknown</FONT></SPAN>
</P>

<P><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier New">Notice the =
two drives that show &quot;unusable&quot;.&nbsp; Now to fix this I've =
been running the following command.</FONT></SPAN>
</P>

<P><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier New">cfgadm -c =
configure c3</FONT></SPAN>
</P>

<P><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier New">In my =
message log I notice reference to</FONT></SPAN>
</P>

<P><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier New">May&nbsp; 5 =
08:03:20 pong &nbsp;&nbsp; transport rejected</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier New">May&nbsp; 5 =
08:03:20 pong genunix: [ID 408114 kern.info] =
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404304,0 (st17) offline =
May&nbsp; 5 08:03:20 pong scsi: [ID 107833 kern.warning] WARNING: =
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404303,0 =
(st16):</FONT></SPAN></P>

<P><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier New">May&nbsp; 5 =
08:03:20 pong &nbsp;&nbsp; transport rejected</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier New">May&nbsp; 5 =
08:03:20 pong genunix: [ID 408114 kern.info] =
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404303,0 (st16) offline =
May&nbsp; 5 08:06:20 pong bptm[3279]: [ID 498531 daemon.error]</FONT><B> =
<FONT SIZE=3D2 FACE=3D"Courier New">user scsi ioctl() failed, may be =
timeout, errno =3D 2,</FONT></B><FONT SIZE=3D2 FACE=3D"Courier New"> =
Error 0 May&nbsp; 5 08:06:20 pong bptm[13159]: [ID 498531 daemon.error] =
user scsi ioctl() failed, may be timeout, errno =3D 2, Error 0 May&nbsp; =
5 08:06:20 pong bptm[3279]: [ID 498531 daemon.error] user scsi ioctl() =
failed, may be timeout, errno =3D 2, No such file or directory May&nbsp; =
5 08:06:20 pong bptm[13159]: [ID 498531 daemon.error] user scsi ioctl() =
failed, may be timeout, errno =3D 2, No such file or directory May&nbsp; =
5 08:06:49 pong scsi: [ID 365881 kern.info] =
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404303,0 =
(st16):</FONT></SPAN></P>

<P><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier New">May&nbsp; 5 =
08:06:49 pong &nbsp;&nbsp; &lt;IBM Ultrium Gen 2 LTO&gt;</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier New">May&nbsp; 5 =
08:06:49 pong scsi: [ID 799468 kern.info] st16 at fp1: name =
w500507630f404303,0, bus address 61300 May&nbsp; 5 08:06:49 pong =
genunix: [ID 936769 kern.info] st16 is =
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404303,0</FONT></SPAN></P>

<P><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier New">May&nbsp; 5 =
08:06:49 pong genunix: [ID 408114 kern.info] =
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404303,0 (st16) online =
May&nbsp; 5 08:06:51 pong scsi: [ID 365881 kern.info] =
/pci@8,700000/SUNW,qlc@3/fp@0,0/st@w500507630f404301,0 =
(st14):</FONT></SPAN></P>

<P><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier New">May&nbsp; 5 =
08:06:51 pong &nbsp;&nbsp; &lt;IBM Ultrium Gen 2 LTO&gt; </FONT></SPAN>
</P>
<BR>
<BR>

<P><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier New">Does this =
problem have to do with a NetBackup timeout value or is this a SUN =
issue?&nbsp; I've got SUN support on these systems and will work the =
issue with them if needed, but I didn't know if someone else on this =
group might have run into the same issue.&nbsp; Maybe there is a kernel =
setting that needs tweaking?</FONT></SPAN></P>

<P><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier New">Thanks for =
any help in advance.</FONT></SPAN>

<BR><SPAN LANG=3D"en-us"><FONT SIZE=3D2 FACE=3D"Courier =
New">Scott</FONT></SPAN>
</P>

</BODY>
</HTML>
------_=_NextPart_001_01C55190.C1D6BC8D--

<Prev in Thread] Current Thread [Next in Thread>