ADSM-L

ANR8302E: How do I test new drive?

2003-12-01 12:19:39
Subject: ANR8302E: How do I test new drive?
From: Tobias Hofmann <tobias.hofmann AT MEDIEN.UNI-WEIMAR DOT DE>
To: ADSM-L AT VM.MARIST DOT EDU
Date: Mon, 1 Dec 2003 18:18:39 +0100
Ladies, gentlemen,

a new, replaced LTO drive gives me headaches - in the following situation:

OS: MS Win2k Advanced Server, 5.00.2195, SP2
TSM: Storage Management Server for Windows - Version 5, Release 1, Level 0.0

After having worked with no probs afaik, last week one of my two HP
ultrium LTO drives died and was replaced by dell (guarantee/bronce
contract). Following this, i wittnessed for two days the cumbersome
process of firmware-updating the library (PV136T, equals a ADIC Scalar
100, or so I am told), after which the system seemed ok. I deleted drive
and path definitions and hoped for the best. This is what I have seen
two days later:

11/27/03   15:19:30      ANR0984I Process 22 for MIGRATION started in
the                          BACKGROUND at 15:19:30.
11/27/03   15:19:30      ANR1000I Migration process 22 started for
storage pool                          DISKPOOL.
11/27/03   15:20:17      ANR8337I LTO volume 000002L1 mounted in drive
MT2.0.0.3                          (mt2.0.0.3).
11/27/03   15:46:35      ANR8337I LTO volume 000005L1 mounted in drive
MT1.0.0.3                          (mt1.0.0.3).
11/27/03   15:46:38      ANR1340I Scratch volume 000005L1 is now defined
in storage                          pool LTOPOOL1.
11/27/03   15:49:53      ANR8302E I/O error on drive MT1.0.0.3
(mt1.0.0.3)                          (OP=WRITE, Error Number=1117,
CC=306, KEY=03, ASC=0C,                          ASCQ=00,
SENSE=71.00.03.00.00.00.00.0E.00.00.00.00.0C.00
.00.00.75.0B.00.00.00.00., Description=Drive or media          failure).
 Refer to Appendix D in the 'Messages' manual                    for
recommended action.
11/27/03   15:49:53      ANR8359E Media fault detected on LTO volume
000005L1 in                          drive MT1.0.0.3 (mt1.0.0.3) of
library LB0.0.0.3.
11/27/03   15:49:53      ANR1411W Access mode for volume 000005L1 now
set to                          "read-only" due to write error.
11/27/03   15:49:53      ANR0523W Transaction failed for session 1837
for node CMS1                          (WinNT) - error on output storage
device.
11/27/03   15:49:59      ANR1341I Scratch volume 000005L1 has been
deleted from                          storage pool LTOPOOL1.
11/27/03   15:50:49      ANR8468I LTO volume 000005L1 dismounted from
drive                          MT1.0.0.3 (mt1.0.0.3) in library LB0.0.0.3.
11/27/03   15:51:41      ANR8337I LTO volume 000009L1 mounted in drive
MT1.0.0.3                          (mt1.0.0.3).
11/27/03   15:51:45      ANR1340I Scratch volume 000009L1 is now defined
in storage                          pool LTOPOOL1.
11/27/03   15:53:40      ANR8302E I/O error on drive MT1.0.0.3
(mt1.0.0.3)                          (OP=WRITE, Error Number=1117,
CC=306, KEY=03, ASC=0C,                          ASCQ=00,
SENSE=71.00.03.00.00.00.00.0E.00.00.00.00.0C.00
.00.00.75.0B.00.00.00.00., Description=Drive or media          failure).
 Refer to Appendix D in the 'Messages' manual                    for
recommended action.
11/27/03   15:53:40      ANR8359E Media fault detected on LTO volume
000009L1 in                          drive MT1.0.0.3 (mt1.0.0.3) of
library LB0.0.0.3.
11/27/03   15:53:40      ANR1411W Access mode for volume 000009L1 now
set to                          "read-only" due to write error.
11/27/03   15:53:40      ANR0523W Transaction failed for session 1837
for node CMS1                          (WinNT) - error on output storage
device.
11/27/03   15:53:46      ANR1341I Scratch volume 000009L1 has been
deleted from                          storage pool LTOPOOL1.
11/27/03   15:54:16      ANR8468I LTO volume 000009L1 dismounted from
drive                          MT1.0.0.3 (mt1.0.0.3) in library LB0.0.0.3.
11/27/03   15:55:06      ANR8337I LTO volume 000010L1 mounted in drive
MT1.0.0.3                          (mt1.0.0.3).
11/27/03   15:55:11      ANR1340I Scratch volume 000010L1 is now defined
in storage                          pool LTOPOOL1.
11/27/03   15:56:47      ANR8302E I/O error on drive MT1.0.0.3
(mt1.0.0.3)                          (OP=WEOF, Error Number=1117,
CC=306, KEY=03, ASC=0C,                          ASCQ=00,
SENSE=70.00.03.00.00.00.00.0E.00.00.00.00.0C.00
.00.00.75.0B.00.00.00.00., Description=Drive or media          failure).
 Refer to Appendix D in the 'Messages' manual                    for
recommended action.
11/27/03   15:56:47      ANR8359E Media fault detected on LTO volume
000010L1 in                          drive MT1.0.0.3 (mt1.0.0.3) of
library LB0.0.0.3.
11/27/03   15:56:47      ANR1401W Mount request denied for volume
000010L1 - mount                          failed.
11/27/03   15:57:18      ANR8468I LTO volume 000010L1 dismounted from
drive                          MT1.0.0.3 (mt1.0.0.3) in library LB0.0.0.3.
11/27/03   16:00:04      ANR8302E I/O error on drive MT1.0.0.3
(mt1.0.0.3)                          (OP=READ, Error Number=1117,
CC=306, KEY=03, ASC=14,                          ASCQ=00,
SENSE=F0.00.03.00.00.00.50.0E.00.00.00.00.14.00
.00.00.50.8F.00.00.00.00., Description=Drive or media          failure).
 Refer to Appendix D in the 'Messages' manual                    for
recommended action.
11/27/03   16:00:04      ANR8355E I/O error reading label for volume
000010L1 in                          drive MT1.0.0.3 (mt1.0.0.3).
11/27/03   16:00:11      ANR0482W Session 1836 for node CMS1 (WinNT)
terminated -                          idle for more than 15 minutes.
11/27/03   16:00:35      ANR8381E LTO volume 000010L1  could not be
mounted in                          drive MT1.0.0.3 (mt1.0.0.3).
11/27/03   16:00:35      ANR1402W Mount request denied for volume
000010L1 - volume                          unavailable.
11/27/03   16:00:35      ANR1410W Access mode for volume 000010L1 now
set to                          "unavailable".
...

This is taking up my already few scratch tapes too fast...

(By the way: What is the correct way to turn these tapes into scratch
tapes again? move data into the storagepool, and then manually turn them
to scratch-status again? or do i miss something there?)

Tests done with the Dell-provided software tools have not given any hint
on problems occuring in the library. I have now been advised to swap the
two drives (one becomes two and vice versa) and see if the problem
persists - but my problem now is that I would not know how to do
reasonable testing.

I don,t know how to trigger migration process manually (searched the
admin ref pdf to no avail),
but even if I did, there is only so many migrations I can do, and then I
am stumped.

I don,t know how to advise TSM to move data using a defined drive - say,
from 1 to 2, and not from 2 to one - how can this be done?

I am quite limited in space and don,t have copy pools, so I am a bit
hesitant to move data with a possibly faulty drive and breaking
something - is there a clever approach doing a copy onto a single tape
volume? Again, I checked the guide but did not see anything wrt this...

Any input would be very much appreciated,

greets, tobi

--
----------------------------------------------------------------------
Dipl.-Ing. Tobias Hofmann   Bauhaus-Universitaet Weimar  D99423 Weimar
Professur fuer Graphische Datenverarbeitung      Projekt medienquadrat
SnailMail:  Bauhaus-Universitaet  Weimar,  Fak. Medien,  D99421 Weimar
Location:     D99423 Weimar     Karl-Haussknechtstr. 7      Zimmer 111
                Fon: ++49-(0)3643-58-3780  Fax : -3701
          e-mail: mailto:tobias.hofmann AT medien.uni-weimar DOT de
----------------------------------------------------------------------

<Prev in Thread] Current Thread [Next in Thread>