Veritas-bu

[Veritas-bu] RE:Veritas-bu] End of Tape (from Nov. 2004)

2005-01-11 13:42:36
Subject: [Veritas-bu] RE:Veritas-bu] End of Tape (from Nov. 2004)
From: Scott.Chapman AT icbc DOT com (Chapman, Scott)
Date: Tue, 11 Jan 2005 10:42:36 -0800
This message is in MIME format. Since your mail reader does not understand
this format, some or all of this message may not be legible.

------_=_NextPart_001_01C4F80D.520E7900
Content-Type: text/plain

Kathryn are you running current firmware on the LTO2 drives?  I seem to
remember something about old firmware doing rewinds before netbackup was
done with the drive . . .
>From your logs:
01/10/2005 13:48:50 albus.ucdavis.edu albus.ucdavis.edu  FREEZING media id
040004, External event caused rewind during write, all data on media is lost

I am running IBM drives (we don't use the LSI logic HBA's) and here is some
output from sgscan -v conf:
/dev/sg/c2t0l0: Tape (/dev/rmt/0): "IBM     ULTRIUM-TD2     38D0" :
NOT-IN-ST-CONFIG-FILE
/dev/sg/c2t1l0: Tape (/dev/rmt/1): "IBM     ULTRIUM-TD2     38D0" :
NOT-IN-ST-CONFIG-FILE
/dev/sg/c2t2l0: Tape (/dev/rmt/2): "IBM     ULTRIUM-TD2     38D0" :
NOT-IN-ST-CONFIG-FILE
...

I don't have anything in the st.conf for the drives as they have been added
to the st several patches ago.  You might check you st patch level as well .
. .

Hope this helps.

Scott Chapman
ICBC - Victoria, Government St.
Phone: 250.414.7650  Cell: 250.213.9295



-----Original Message-----
From: Kathryn Hemness [mailto:kfhemness AT ucdavis DOT edu] 
Sent: Tuesday, January 11, 2005 10:02 AM
To: veritas-bu AT mailman.eng.auburn DOT edu
Cc: song_1977 AT yahoo DOT com
Subject: [Veritas-bu] RE:Veritas-bu] End of Tape (from Nov. 2004)


Good Morning --

Was there ever a resolution to your NB5.0MP2/LTO end of tape problem?

I'm currently fighting with a new installation NB5.1 on a Solaris 9 system
using
LTO2 tape drives.  My backups ALWAYS fail either at a checkpoint-restart
WRITE or
at the very last WRITE of the backup, regardless of how big the backup is.

I've been told by my NetBackup tech support (via Sun) that it was a hardware
configuration problem.

The backups always fail, regardless of any st.conf modifications and I've
even
taken the fiber switch out of the mix.  Here's a summary of my hardware and
the
types of errors I'm seeing (by the way, ufsdump works just  fine....).

Master: Solaris 9 version 4/04 on a Sun V240 with 2 LSI Logic FC919X HBAs
running
NB5.1 Enterprise Server.  One LSI Logic HBA is connected directly to the
fiber/scsi
bridge of a Qualstar 88264 LTO2 library, the other to a Brocade 32-port
fiber
switch attached to a Sun 3511 storage array.

I have tried at least 4 different st.conf LTO2 configurations with same
failing
results and am now not using any special LTO2 definitions.

Here are the failure errors from both the NetBackup reports and from the
bptm logs:

01/10/2005 13:48:50 albus.ucdavis.edu albus.ucdavis.edu  FREEZING media id
040004, External event caused rewind during write, all data on media is lost
01/10/2005 13:48:54 albus.ucdavis.edu albus.ucdavis.edu  CLIENT
albus.ucdavis.edu  POLICY IR-ISM_02  SCHED WeeklyFull  EXIT STATUS 84 (media
write error)
01/10/2005 13:48:54 albus.ucdavis.edu albus.ucdavis.edu  backup of client
albus.ucdavis.edu exited with status 84 (media write error)

Here's the bptm log entry for the above error:

13:48:48.032 [1297] <2> write_backup: tp.tv_sec = 1105393728, stp.tv_sec =
1105391634, tp.tv_usec = 27455, stp.tv_usec = 544901, et = 2093483,
mpx_total_kbytes[TWIN_INDEX = 0] = 21261376
13:48:48.075 [1297] <2> io_terminate_tape: writing empty backup header,
drive index 0, copy 1
13:48:48.091 [1297] <2> io_ioctl: command (0)MTWEOF 1 from (bptm.c.7919) on
drive index 0
13:48:48.645 [1297] <2> io_write_back_header: drive index 0, empty_file,
file num = 2, mpx_headers = 0, copy 1
13:48:48.650 [1297] <2> io_close: closing
/usr/openv/netbackup/db/media/tpreq/040004, from bptm.c.8046
13:48:50.848 [1297] <2> io_terminate_tape: absolute block position prior to
writing empty header is 332201, copy 1
13:48:50.848 [1297] <2> io_terminate_tape: block position check: actual
332201, expected 332213
13:48:50.848 [1297] <2> set_job_details: Sending Tfile jobid (907)
13:48:50.848 [1297] <2> set_job_details: LOG 1105393730 16 bptm 1297
FREEZING media id 040004, External event caused rewind during write, all
data on media is lost

13:48:50.848 [1297] <2> set_job_details: Done
13:48:50.880 [1297] <16> io_terminate_tape: FREEZING media id 040004,
External event caused rewind during write, all data on media is lost
13:48:50.898 [1297] <2> log_media_error: successfully wrote to error file -
01/10/05 13:48:50 040004 0 WRITE_ERROR
13:48:50.910 [1297] <2> check_error_history: called from bptm line 17870,
EXIT_Status = 84
13:48:50.911 [1297] <2> check_error_history: drive index = 0, media id =
040004, time = 01/10/05 13:48:50, both_match = 0, media_match = 0,
drive_match = 0
13:48:50.911 [1297] <2> tpunmount: Check_for_waiting = 0,
No_tpunmount_after_restore = 0, Media_Unmount_Delay = 0, MediaOffset = 4
13:48:50.911 [1297] <2> tpunmount: tpunmount'ing
/usr/openv/netbackup/db/media/tpreq/040004


Since ufsdump works, this is indicating a NetBackup 5.1 problem.  Anyway, I
notice
in your post-November posts, you referred to NB4.5 servers.  Did you have to
downgrade NetBackup in order to get your LTO drives to work properly?


--kathy

============================================================================
===
Kathryn Hemness                        kfhemness AT ucdavis DOT edu
System Administrator                   phone: 530.752.6547
Campus Data Center & Client Services   fax:   530.752.9154
_______________________________________________
Veritas-bu maillist  -  Veritas-bu AT mailman.eng.auburn DOT edu
http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu

------_=_NextPart_001_01C4F80D.520E7900
Content-Type: text/html
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
charset=3Dus-ascii">
<META NAME=3D"Generator" CONTENT=3D"MS Exchange Server version =
5.5.2653.12">
<TITLE>RE: [Veritas-bu] RE:Veritas-bu] End of Tape (from Nov. =
2004)</TITLE>
</HEAD>
<BODY>

<P><FONT SIZE=3D2>Kathryn are you running current firmware on the LTO2 =
drives?&nbsp; I seem to remember something about old firmware doing =
rewinds before netbackup was done with the drive . . .</FONT></P>

<P><FONT SIZE=3D2>From your logs:</FONT>
<BR><FONT SIZE=3D2>01/10/2005 13:48:50 albus.ucdavis.edu =
albus.ucdavis.edu&nbsp; FREEZING media id 040004, External event caused =
rewind during write, all data on media is lost</FONT></P>

<P><FONT SIZE=3D2>I am running IBM drives (we don't use the LSI logic =
HBA's) and here is some output from sgscan -v conf:</FONT>
<BR><FONT SIZE=3D2>/dev/sg/c2t0l0: Tape (/dev/rmt/0): =
&quot;IBM&nbsp;&nbsp;&nbsp;&nbsp; ULTRIUM-TD2&nbsp;&nbsp;&nbsp;&nbsp; =
38D0&quot; : NOT-IN-ST-CONFIG-FILE</FONT>
<BR><FONT SIZE=3D2>/dev/sg/c2t1l0: Tape (/dev/rmt/1): =
&quot;IBM&nbsp;&nbsp;&nbsp;&nbsp; ULTRIUM-TD2&nbsp;&nbsp;&nbsp;&nbsp; =
38D0&quot; : NOT-IN-ST-CONFIG-FILE</FONT>
<BR><FONT SIZE=3D2>/dev/sg/c2t2l0: Tape (/dev/rmt/2): =
&quot;IBM&nbsp;&nbsp;&nbsp;&nbsp; ULTRIUM-TD2&nbsp;&nbsp;&nbsp;&nbsp; =
38D0&quot; : NOT-IN-ST-CONFIG-FILE</FONT>
<BR><FONT SIZE=3D2>...</FONT>
</P>

<P><FONT SIZE=3D2>I don't have anything in the st.conf for the drives =
as they have been added to the st several patches ago.&nbsp; You might =
check you st patch level as well . . .</FONT></P>

<P><FONT SIZE=3D2>Hope this helps.</FONT>
</P>

<P><FONT SIZE=3D2>Scott Chapman</FONT>
<BR><FONT SIZE=3D2>ICBC - Victoria, Government St.</FONT>
<BR><FONT SIZE=3D2>Phone: 250.414.7650&nbsp; Cell: 250.213.9295</FONT>
</P>
<BR>
<BR>

<P><FONT SIZE=3D2>-----Original Message-----</FONT>
<BR><FONT SIZE=3D2>From: Kathryn Hemness [<A =
HREF=3D"mailto:kfhemness AT ucdavis DOT edu">mailto:kfhemness AT ucdavis DOT 
edu</A>] =
</FONT>
<BR><FONT SIZE=3D2>Sent: Tuesday, January 11, 2005 10:02 AM</FONT>
<BR><FONT SIZE=3D2>To: veritas-bu AT mailman.eng.auburn DOT edu</FONT>
<BR><FONT SIZE=3D2>Cc: song_1977 AT yahoo DOT com</FONT>
<BR><FONT SIZE=3D2>Subject: [Veritas-bu] RE:Veritas-bu] End of Tape =
(from Nov. 2004)</FONT>
</P>
<BR>

<P><FONT SIZE=3D2>Good Morning --</FONT>
</P>

<P><FONT SIZE=3D2>Was there ever a resolution to your NB5.0MP2/LTO end =
of tape problem?</FONT>
</P>

<P><FONT SIZE=3D2>I'm currently fighting with a new installation NB5.1 =
on a Solaris 9 system using</FONT>
<BR><FONT SIZE=3D2>LTO2 tape drives.&nbsp; My backups ALWAYS fail =
either at a checkpoint-restart WRITE or</FONT>
<BR><FONT SIZE=3D2>at the very last WRITE of the backup, regardless of =
how big the backup is.</FONT>
</P>

<P><FONT SIZE=3D2>I've been told by my NetBackup tech support (via Sun) =
that it was a hardware</FONT>
<BR><FONT SIZE=3D2>configuration problem.</FONT>
</P>

<P><FONT SIZE=3D2>The backups always fail, regardless of any st.conf =
modifications and I've even</FONT>
<BR><FONT SIZE=3D2>taken the fiber switch out of the mix.&nbsp; Here's =
a summary of my hardware and the</FONT>
<BR><FONT SIZE=3D2>types of errors I'm seeing (by the way, ufsdump =
works just&nbsp; fine....).</FONT>
</P>

<P><FONT SIZE=3D2>Master: Solaris 9 version 4/04 on a Sun V240 with 2 =
LSI Logic FC919X HBAs running</FONT>
<BR><FONT SIZE=3D2>NB5.1 Enterprise Server.&nbsp; One LSI Logic HBA is =
connected directly to the fiber/scsi</FONT>
<BR><FONT SIZE=3D2>bridge of a Qualstar 88264 LTO2 library, the other =
to a Brocade 32-port fiber</FONT>
<BR><FONT SIZE=3D2>switch attached to a Sun 3511 storage array.</FONT>
</P>

<P><FONT SIZE=3D2>I have tried at least 4 different st.conf LTO2 =
configurations with same failing</FONT>
<BR><FONT SIZE=3D2>results and am now not using any special LTO2 =
definitions.</FONT>
</P>

<P><FONT SIZE=3D2>Here are the failure errors from both the NetBackup =
reports and from the bptm logs:</FONT>
</P>

<P><FONT SIZE=3D2>01/10/2005 13:48:50 albus.ucdavis.edu =
albus.ucdavis.edu&nbsp; FREEZING media id 040004, External event caused =
rewind during write, all data on media is lost</FONT></P>

<P><FONT SIZE=3D2>01/10/2005 13:48:54 albus.ucdavis.edu =
albus.ucdavis.edu&nbsp; CLIENT albus.ucdavis.edu&nbsp; POLICY =
IR-ISM_02&nbsp; SCHED WeeklyFull&nbsp; EXIT STATUS 84 (media write =
error)</FONT></P>

<P><FONT SIZE=3D2>01/10/2005 13:48:54 albus.ucdavis.edu =
albus.ucdavis.edu&nbsp; backup of client albus.ucdavis.edu exited with =
status 84 (media write error)</FONT></P>

<P><FONT SIZE=3D2>Here's the bptm log entry for the above error:</FONT>
</P>

<P><FONT SIZE=3D2>13:48:48.032 [1297] &lt;2&gt; write_backup: tp.tv_sec =
=3D 1105393728, stp.tv_sec =3D 1105391634, tp.tv_usec =3D 27455, =
stp.tv_usec =3D 544901, et =3D 2093483, mpx_total_kbytes[TWIN_INDEX =3D =
0] =3D 21261376</FONT></P>

<P><FONT SIZE=3D2>13:48:48.075 [1297] &lt;2&gt; io_terminate_tape: =
writing empty backup header, drive index 0, copy 1</FONT>
<BR><FONT SIZE=3D2>13:48:48.091 [1297] &lt;2&gt; io_ioctl: command =
(0)MTWEOF 1 from (bptm.c.7919) on drive index 0</FONT>
<BR><FONT SIZE=3D2>13:48:48.645 [1297] &lt;2&gt; io_write_back_header: =
drive index 0, empty_file, file num =3D 2, mpx_headers =3D 0, copy =
1</FONT>
<BR><FONT SIZE=3D2>13:48:48.650 [1297] &lt;2&gt; io_close: closing =
/usr/openv/netbackup/db/media/tpreq/040004, from bptm.c.8046</FONT>
<BR><FONT SIZE=3D2>13:48:50.848 [1297] &lt;2&gt; io_terminate_tape: =
absolute block position prior to writing empty header is 332201, copy =
1</FONT>
<BR><FONT SIZE=3D2>13:48:50.848 [1297] &lt;2&gt; io_terminate_tape: =
block position check: actual 332201, expected 332213</FONT>
<BR><FONT SIZE=3D2>13:48:50.848 [1297] &lt;2&gt; set_job_details: =
Sending Tfile jobid (907)</FONT>
<BR><FONT SIZE=3D2>13:48:50.848 [1297] &lt;2&gt; set_job_details: LOG =
1105393730 16 bptm 1297 FREEZING media id 040004, External event caused =
rewind during write, all data on media is lost</FONT></P>

<P><FONT SIZE=3D2>13:48:50.848 [1297] &lt;2&gt; set_job_details: =
Done</FONT>
<BR><FONT SIZE=3D2>13:48:50.880 [1297] &lt;16&gt; io_terminate_tape: =
FREEZING media id 040004, External event caused rewind during write, =
all data on media is lost</FONT></P>

<P><FONT SIZE=3D2>13:48:50.898 [1297] &lt;2&gt; log_media_error: =
successfully wrote to error file - 01/10/05 13:48:50 040004 0 =
WRITE_ERROR</FONT>
<BR><FONT SIZE=3D2>13:48:50.910 [1297] &lt;2&gt; check_error_history: =
called from bptm line 17870, EXIT_Status =3D 84</FONT>
<BR><FONT SIZE=3D2>13:48:50.911 [1297] &lt;2&gt; check_error_history: =
drive index =3D 0, media id =3D 040004, time =3D 01/10/05 13:48:50, =
both_match =3D 0, media_match =3D 0, drive_match =3D 0</FONT></P>

<P><FONT SIZE=3D2>13:48:50.911 [1297] &lt;2&gt; tpunmount: =
Check_for_waiting =3D 0, No_tpunmount_after_restore =3D 0, =
Media_Unmount_Delay =3D 0, MediaOffset =3D 4</FONT></P>

<P><FONT SIZE=3D2>13:48:50.911 [1297] &lt;2&gt; tpunmount: =
tpunmount'ing /usr/openv/netbackup/db/media/tpreq/040004</FONT>
</P>
<BR>

<P><FONT SIZE=3D2>Since ufsdump works, this is indicating a NetBackup =
5.1 problem.&nbsp; Anyway, I notice</FONT>
<BR><FONT SIZE=3D2>in your post-November posts, you referred to NB4.5 =
servers.&nbsp; Did you have to</FONT>
<BR><FONT SIZE=3D2>downgrade NetBackup in order to get your LTO drives =
to work properly?</FONT>
</P>
<BR>

<P><FONT SIZE=3D2>--kathy</FONT>
</P>

<P><FONT =
SIZE=3D2>=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D</FONT>
<BR><FONT SIZE=3D2>Kathryn =
Hemness&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp=
;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp=
; kfhemness AT ucdavis DOT edu</FONT>
<BR><FONT SIZE=3D2>System =
Administrator&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp=
;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; phone: =
530.752.6547</FONT>
<BR><FONT SIZE=3D2>Campus Data Center &amp; Client Services&nbsp;&nbsp; =
fax:&nbsp;&nbsp; 530.752.9154</FONT>
<BR><FONT =
SIZE=3D2>_______________________________________________</FONT>
<BR><FONT SIZE=3D2>Veritas-bu maillist&nbsp; -&nbsp; =
Veritas-bu AT mailman.eng.auburn DOT edu</FONT>
<BR><FONT SIZE=3D2><A =
HREF=3D"http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu"; =
TARGET=3D"_blank">http://mailman.eng.auburn.edu/mailman/listinfo/veritas=
-bu</A></FONT>
</P>

</BODY>
</HTML>
------_=_NextPart_001_01C4F80D.520E7900--