ADSM-L

Re: LTO drive error on MOVE DRMEDIA

2006-02-23 20:44:20
Subject: Re: LTO drive error on MOVE DRMEDIA
From: Len Boyle <Len.Boyle AT SAS DOT COM>
To: ADSM-L AT VM.MARIST DOT EDU
Date: Thu, 23 Feb 2006 20:44:02 -0500
Andrew

For more information on the library errors you can look at three (or four 
)places if you have the latest library firmware. One less if you do not. 

With the latest firmware level the web interface to the library will let you 
view the library error log. 
It is in the service group. You will need the maintenance Information book 
which is downloadable from ibm's web site to look up the error codes. Or there 
is an hard copy that came with the library. 
If you do not have the latest firmware you can view the error logs from the 
front panel. 
There is also an lcd display on the front of the lto tape drives which displays 
 a subset of errors. 
And you can turn on snmp traps to send error messages out to you. 

I believe that IBM also added a feature where one can download the error logs, 
but I have not looked into that yet. 

Let us know what you find out. 

Regards len 

-----Original Message-----
From: ADSM: Dist Stor Manager [mailto:ADSM-L AT VM.MARIST DOT EDU] On Behalf Of 
Andrew Ferris
Sent: Thursday, February 23, 2006 2:00 PM
To: ADSM-L AT VM.MARIST DOT EDU
Subject: [ADSM-L] LTO drive error on MOVE DRMEDIA

Hi *DSM-ers,

TSM Server 5.2.3.2 on Win 2000
3584 Tape Library with 4 LTO-2 drives

It's offsite backup day and our regularly scheduled MOVE DRMEDIA * WHERE 
STATE=MOUNTABLE... produced this error on one of our tapes:

02/23/2006 09:04:46   ANR6696I MOVE DRMEDIA: CHECKOUT LIBVOLUME for
volume
                       ICA074L2 in library 3584 starting. (SESSION:
95513,
                       PROCESS: 3435)

02/23/2006 09:05:00   ANR8336I Verifying label of LTO volume ICA074L2
in drive
                       F1R3 (mt2.0.0.4). (SESSION: 95513, PROCESS:
3435)
02/23/2006 09:05:15   ANR8942E Could not move volume ICA074L2 from
slot-element
                       259 to slot-element 774. (SESSION: 95513,
PROCESS: 3435)
02/23/2006 09:05:15   ANR8418E CHECKOUT LIBVOLUME: An I/O error
occurred while
                       accessing library 3584. (SESSION: 95513,
PROCESS: 3435)
02/23/2006 09:05:15   ANR6698E MOVE DRMEDIA: CHECKOUT LIBVOLUME for
volume
                       ICA074L2 in library 3584 failed. (SESSION:
95513,
                       PROCESS: 3435)

slot-element 259 is one of the LTO-2 drive (F1R3 to be precise) and 
slot-element 774 is in the 3584's I/O station. I re-ran the MOVE DRMEDIA 
command for that specific tape and it moved to the I/O station just fine though 
via another LTO drive.

A quick look at the TSM admin console (dsmadmc) showed this additional error on 
a subsequent DRM tape:

9:06:59 MOVE DRMEDIA: CHECKOUT LIBVOLUME for volume ICA051L2 in library
3584 starting.
ANR8302E I/O error on drive F1R3 (mt2.0.0.4) (OP=OFFL, Error Number=21, CC=0, 
KEY=02, ASC=04, 
ASCQ=02,~SENSE=70.00.02.00.00.00.00.1C.00.00.00.00.04.02.30.00.10.12.00.00.00.00.20.20.20.20.20.20.20.00.00.00.00.00.00.13.00.00.00.14.FF.FF.FF.FF.6B.8F.00.00.6C.8F.00.00.6D.8F.00.00.6E.8F.00.00.6F.8F.00.00.70.8F.00.00.71.8F.00.00.72.8F.00.00.73.8F.00.00.74.8F.00.00.75.8F.00.00.76.8F.00.00.77.8F.00.00,~Description=An
undetermined error has occurred).  Refer to Appendix D in the 'Messages'
manual for recommended action.

So that's the same drive (F1R3). I've looked up the ANR 8302E in the
5.2 messages manual:

User Response: Ensure that the DEVICE parameter associated with the drive was 
identified correctly in the DEFINE PATH command, and that the device is 
currently powered on and ready. The drive or library reference manual provided 
with the device usually contain tables that explain the values of the KEY, ASC, 
and ASCQ fields. If the problem persists, contact your service representative 
and provide the internal code values and sense data from this message.

All four drives show as online in the 3584's IBM UltraScalable Specialist Web 
page and they also show as online in TSM.

I see KEY=02 equaling "not ready" in the messages manual. ASC=04 &
ASCQ=02 are "Not ready, initializing command required"

This the first tape error I've seen so I'm a bit flummoxed as to where to go to 
next. Could someone please give me a bit of advice or context on these errors?

thanks,


Andrew Ferris
Network Support Analyst
iCAPTURE Research Centre
University of British Columbia

<Prev in Thread] Current Thread [Next in Thread>