ADSM-L

Re: [ADSM-L] ANR8943E Hardware or media error on library CDLA_PROD

2008-01-31 10:39:23
Subject: Re: [ADSM-L] ANR8943E Hardware or media error on library CDLA_PROD
From: Joni Moyer <joni.moyer AT HIGHMARK DOT COM>
To: ADSM-L AT VM.MARIST DOT EDU
Date: Thu, 31 Jan 2008 10:38:36 -0500
Thanks Richard,

I was able to find that PA0319 is in slot-element 1344, but I'm not sure
how to see what is in slot-element 257?  The virtual volume PA0319 was one
of the volumes that we were told is damaged and unrecoverable.  In order
to remove this volume from CDLA_PROD which is defined as a SCSI library
would I just be able to checkout libvol cdla_prod pa0319, but would I want
remove=yes, bulk, or no for a virtual 3984 library? It doesn't look like
it uses the checkl parameter for this type of library, but would I want
force=yes or force=no?  Or do I not want to check the volume out at all? I
was thinking if I did I could then re-run the audit to see if it runs
clean.

You are correct.  After this fiasco I do wonder if this virtual library is
stable at all.  EMC has stated that it is, but from our 20+ hour outage
and data loss you can see why I am not too confident in their ability or
their word on this.

I do have mounts/dismounts occurring for this library and data is being
written to the virtual volumes.  I'm just wondering if it's having issues
with this 1 bad virtual volume...

If anyone has any thoughts/ideas I'm open to suggestion!  EMC hasn't been
any help at this point.  Thanks again!

Date/Time             Message
--------------------
----------------------------------------------------------
01/31/08 09:31:31     ANR0984I Process 5034 for AUDIT LIBRARY started in
the
                       BACKGROUND at 09:31:31. (SESSION: 53460, PROCESS:
5034)
01/31/08 09:31:31     ANR8457I AUDIT LIBRARY: Operation for library
CDLA_PROD
                       started as process 5034. (SESSION: 53460, PROCESS:
5034)
01/31/08 09:31:31     ANR0609I AUDIT LIBRARY started as process 5034.
(SESSION:
                       53460, PROCESS: 5034)
01/31/08 09:32:16     ANR8943E Hardware or media error on library
CDLA_PROD
                       (OP=00006C03, CC=-1, KEY=04, ASC=40, ASCQ=02,
SENSE=F0.0
 0.04.00.00.00.00.0C.00.00.00.00.40.02.00.00.00.00.00.00.,
                       Description=An undetermined error has occurred).
Refer
                       to Appendix D in the 'Messages' manual for
recommended
                       action. (SESSION: 53460, PROCESS: 5034)
01/31/08 09:32:16     ANR8942E Could not move volume NOT KNOWN from
slot-element
                       1344 to slot-element 257. (SESSION: 53460, PROCESS:
5034)
01/31/08 09:32:18     ANR8460E AUDIT LIBRARY process for library CDLA_PROD

                       failed. (SESSION: 53460, PROCESS: 5034)

********************************
Joni Moyer
Highmark
Storage Systems, Storage Mngt Analyst III
Phone Number: (717)302-9966
Fax: (717) 302-9826
joni.moyer AT highmark DOT com
********************************



"Richard Sims" <rbs AT BU DOT EDU>
Sent by: "ADSM: Dist Stor Manager" <ADSM-L AT VM.MARIST DOT EDU>
01/31/2008 10:01 AM
Please respond to
"ADSM: Dist Stor Manager" <ADSM-L AT VM.MARIST DOT EDU>


To
ADSM-L AT VM.MARIST DOT EDU
cc

Subject
Re: ANR8943E Hardware or media error on library CDLA_PROD






On Jan 31, 2008, at 9:39 AM, Joni Moyer wrote:

> ...
>  Does anyone know where I can find those codes?
> ...
>                        (OP=00006C03, CC=-1, KEY=04, ASC=40, ASCQ=02,
>

Those codes are probably too general to be useful.  See IBM Technote
1253273, for example.
A good EMC customer engineer may be able to make something out of the
sense bytes, but that's probably a long shot.

An EMC customer on the List may be able to offer some advice, but
it's probably the case that the library is simply malfunctioning and
not properly communicating with TSM.  Check your TSM paths status,
for completeness.  There may be error conditions reflected in the
library itself.

Based upon the data loss your site experienced with that library,
your management should question whether to continue depending upon
it.  It seems rather shakey to me from that error.

    Richard Sims

<Prev in Thread] Current Thread [Next in Thread>