ADSM-L

Re: ANR8302E: How do I test new drive?

2003-12-03 01:54:53
Subject: Re: ANR8302E: How do I test new drive?
From: Tobias Hofmann <tobias.hofmann AT MEDIEN.UNI-WEIMAR DOT DE>
To: ADSM-L AT VM.MARIST DOT EDU
Date: Wed, 3 Dec 2003 07:54:18 +0100
Jack,

On 01.12.2003 20:22, Coats, Jack wrote:

Just a thought, take TSM server down.

Did so yesterday...

Use an OS native program (Microsoft Backup) using a hand fed scratch tape
or two to backup the local system and restore a few files until you believe
the
drive will hold its own.

...and failed to get it to work with ntbackup - several problems from
"too many tapes in library" to refusal to write to the tape in the
drive. grrr.

I see your point (and thanks for the input), but does anyone have
another approach?

It would be very much appreciated...

TIA, greets, tobi...

Then start with TSM.  If the same tapes are read and written OK in other
drives, then you still have a drive problem, IMHO.  (Or cable, or SCSI
controller).
If all the errors are on the same drive, you may still have a problem.

LOL ... JC


-----Original Message-----
From: Tobias Hofmann [SMTP:tobias.hofmann AT MEDIEN.UNI-WEIMAR DOT DE]
Sent: Monday, December 01, 2003 11:19 AM
To:   ADSM-L AT VM.MARIST DOT EDU
Subject:      ANR8302E: How do I test new drive?

Ladies, gentlemen,

a new, replaced LTO drive gives me headaches - in the following situation:

OS: MS Win2k Advanced Server, 5.00.2195, SP2
TSM: Storage Management Server for Windows - Version 5, Release 1, Level
0.0

After having worked with no probs afaik, last week one of my two HP
ultrium LTO drives died and was replaced by dell (guarantee/bronce
contract). Following this, i wittnessed for two days the cumbersome
process of firmware-updating the library (PV136T, equals a ADIC Scalar
100, or so I am told), after which the system seemed ok. I deleted drive
and path definitions and hoped for the best. This is what I have seen
two days later:

11/27/03   15:19:30      ANR0984I Process 22 for MIGRATION started in
the                          BACKGROUND at 15:19:30.
11/27/03   15:19:30      ANR1000I Migration process 22 started for
storage pool                          DISKPOOL.
11/27/03   15:20:17      ANR8337I LTO volume 000002L1 mounted in drive
MT2.0.0.3                          (mt2.0.0.3).
11/27/03   15:46:35      ANR8337I LTO volume 000005L1 mounted in drive
MT1.0.0.3                          (mt1.0.0.3).
11/27/03   15:46:38      ANR1340I Scratch volume 000005L1 is now defined
in storage                          pool LTOPOOL1.
11/27/03   15:49:53      ANR8302E I/O error on drive MT1.0.0.3
(mt1.0.0.3)                          (OP=WRITE, Error Number=1117,
CC=306, KEY=03, ASC=0C,                          ASCQ=00,
SENSE=71.00.03.00.00.00.00.0E.00.00.00.00.0C.00
.00.00.75.0B.00.00.00.00., Description=Drive or media          failure).
 Refer to Appendix D in the 'Messages' manual                    for
recommended action.
11/27/03   15:49:53      ANR8359E Media fault detected on LTO volume
000005L1 in                          drive MT1.0.0.3 (mt1.0.0.3) of
library LB0.0.0.3.
11/27/03   15:49:53      ANR1411W Access mode for volume 000005L1 now
set to                          "read-only" due to write error.
11/27/03   15:49:53      ANR0523W Transaction failed for session 1837
for node CMS1                          (WinNT) - error on output storage
device.
11/27/03   15:49:59      ANR1341I Scratch volume 000005L1 has been
deleted from                          storage pool LTOPOOL1.
11/27/03   15:50:49      ANR8468I LTO volume 000005L1 dismounted from
drive                          MT1.0.0.3 (mt1.0.0.3) in library LB0.0.0.3.
11/27/03   15:51:41      ANR8337I LTO volume 000009L1 mounted in drive
MT1.0.0.3                          (mt1.0.0.3).
11/27/03   15:51:45      ANR1340I Scratch volume 000009L1 is now defined
in storage                          pool LTOPOOL1.
11/27/03   15:53:40      ANR8302E I/O error on drive MT1.0.0.3
(mt1.0.0.3)                          (OP=WRITE, Error Number=1117,
CC=306, KEY=03, ASC=0C,                          ASCQ=00,
SENSE=71.00.03.00.00.00.00.0E.00.00.00.00.0C.00
.00.00.75.0B.00.00.00.00., Description=Drive or media          failure).
 Refer to Appendix D in the 'Messages' manual                    for
recommended action.
11/27/03   15:53:40      ANR8359E Media fault detected on LTO volume
000009L1 in                          drive MT1.0.0.3 (mt1.0.0.3) of
library LB0.0.0.3.
11/27/03   15:53:40      ANR1411W Access mode for volume 000009L1 now
set to                          "read-only" due to write error.
11/27/03   15:53:40      ANR0523W Transaction failed for session 1837
for node CMS1                          (WinNT) - error on output storage
device.
11/27/03   15:53:46      ANR1341I Scratch volume 000009L1 has been
deleted from                          storage pool LTOPOOL1.
11/27/03   15:54:16      ANR8468I LTO volume 000009L1 dismounted from
drive                          MT1.0.0.3 (mt1.0.0.3) in library LB0.0.0.3.
11/27/03   15:55:06      ANR8337I LTO volume 000010L1 mounted in drive
MT1.0.0.3                          (mt1.0.0.3).
11/27/03   15:55:11      ANR1340I Scratch volume 000010L1 is now defined
in storage                          pool LTOPOOL1.
11/27/03   15:56:47      ANR8302E I/O error on drive MT1.0.0.3
(mt1.0.0.3)                          (OP=WEOF, Error Number=1117,
CC=306, KEY=03, ASC=0C,                          ASCQ=00,
SENSE=70.00.03.00.00.00.00.0E.00.00.00.00.0C.00
.00.00.75.0B.00.00.00.00., Description=Drive or media          failure).
 Refer to Appendix D in the 'Messages' manual                    for
recommended action.
11/27/03   15:56:47      ANR8359E Media fault detected on LTO volume
000010L1 in                          drive MT1.0.0.3 (mt1.0.0.3) of
library LB0.0.0.3.
11/27/03   15:56:47      ANR1401W Mount request denied for volume
000010L1 - mount                          failed.
11/27/03   15:57:18      ANR8468I LTO volume 000010L1 dismounted from
drive                          MT1.0.0.3 (mt1.0.0.3) in library LB0.0.0.3.
11/27/03   16:00:04      ANR8302E I/O error on drive MT1.0.0.3
(mt1.0.0.3)                          (OP=READ, Error Number=1117,
CC=306, KEY=03, ASC=14,                          ASCQ=00,
SENSE=F0.00.03.00.00.00.50.0E.00.00.00.00.14.00
.00.00.50.8F.00.00.00.00., Description=Drive or media          failure).
 Refer to Appendix D in the 'Messages' manual                    for
recommended action.
11/27/03   16:00:04      ANR8355E I/O error reading label for volume
000010L1 in                          drive MT1.0.0.3 (mt1.0.0.3).
11/27/03   16:00:11      ANR0482W Session 1836 for node CMS1 (WinNT)
terminated -                          idle for more than 15 minutes.
11/27/03   16:00:35      ANR8381E LTO volume 000010L1  could not be
mounted in                          drive MT1.0.0.3 (mt1.0.0.3).
11/27/03   16:00:35      ANR1402W Mount request denied for volume
000010L1 - volume                          unavailable.
11/27/03   16:00:35      ANR1410W Access mode for volume 000010L1 now
set to                          "unavailable".
...

This is taking up my already few scratch tapes too fast...

(By the way: What is the correct way to turn these tapes into scratch
tapes again? move data into the storagepool, and then manually turn them
to scratch-status again? or do i miss something there?)

Tests done with the Dell-provided software tools have not given any hint
on problems occuring in the library. I have now been advised to swap the
two drives (one becomes two and vice versa) and see if the problem
persists - but my problem now is that I would not know how to do
reasonable testing.

I don,t know how to trigger migration process manually (searched the
admin ref pdf to no avail),
but even if I did, there is only so many migrations I can do, and then I
am stumped.

I don,t know how to advise TSM to move data using a defined drive - say,
from 1 to 2, and not from 2 to one - how can this be done?

I am quite limited in space and don,t have copy pools, so I am a bit
hesitant to move data with a possibly faulty drive and breaking
something - is there a clever approach doing a copy onto a single tape
volume? Again, I checked the guide but did not see anything wrt this...

Any input would be very much appreciated,

greets, tobi

--
----------------------------------------------------------------------
Dipl.-Ing. Tobias Hofmann   Bauhaus-Universitaet Weimar  D99423 Weimar
Professur fuer Graphische Datenverarbeitung      Projekt medienquadrat
SnailMail:  Bauhaus-Universitaet  Weimar,  Fak. Medien,  D99421 Weimar
Location:     D99423 Weimar     Karl-Haussknechtstr. 7      Zimmer 111
                Fon: ++49-(0)3643-58-3780  Fax : -3701
          e-mail: mailto:tobias.hofmann AT medien.uni-weimar DOT de
----------------------------------------------------------------------



--
----------------------------------------------------------------------
Dipl.-Ing. Tobias Hofmann   Bauhaus-Universitaet Weimar  D99423 Weimar
Professur fuer Graphische Datenverarbeitung      Projekt medienquadrat
SnailMail:  Bauhaus-Universitaet  Weimar,  Fak. Medien,  D99421 Weimar
Location:     D99423 Weimar     Karl-Haussknechtstr. 7      Zimmer 111
                Fon: ++49-(0)3643-58-3780  Fax : -3701
          e-mail: mailto:tobias.hofmann AT medien.uni-weimar DOT de
----------------------------------------------------------------------

<Prev in Thread] Current Thread [Next in Thread>