ADSM-L

Re: [ADSM-L] Fw: ANR8943E Hardware or media error on library CDLA_PROD

2008-01-31 15:03:39
Subject: Re: [ADSM-L] Fw: ANR8943E Hardware or media error on library CDLA_PROD
From: Joni Moyer <joni.moyer AT HIGHMARK DOT COM>
To: ADSM-L AT VM.MARIST DOT EDU
Date: Thu, 31 Jan 2008 14:46:31 -0500
Thanks Nicholas!

The command: checkout libvol cdla_prod pa0319 rem=bulk checkl=n was
successful!

I re-ran the audit of cdla_prod and it completed successfully!

********************************
Joni Moyer
Highmark
Storage Systems, Storage Mngt Analyst III
Phone Number: (717)302-9966
Fax: (717) 302-9826
joni.moyer AT highmark DOT com
********************************



"Nicholas Cassimatis" <nickpc AT US.IBM DOT COM>
Sent by: "ADSM: Dist Stor Manager" <ADSM-L AT VM.MARIST DOT EDU>
01/31/2008 10:52 AM
Please respond to
"ADSM: Dist Stor Manager" <ADSM-L AT VM.MARIST DOT EDU>


To
ADSM-L AT VM.MARIST DOT EDU
cc

Subject
Fw: ANR8943E Hardware or media error on library CDLA_PROD






Isn't Element 257 normally a tape drive element number?  So the library is
failing to mount a tape (which has already been established as bad).  With
a physical library, you would want to remove the volume with the checkout
-
I would think you'd use "remove=bulk" but with the volser showing up as
"NOT KNOWN" I'm not sure if the library will find it.  "checklabel=no"
would probably be recommended, since mounting to read the label is what is
giving you the error.

Nick Cassimatis

----- Forwarded by Nicholas Cassimatis/Raleigh/IBM on 01/31/2008 10:47 AM
-----

"ADSM: Dist Stor Manager" <ADSM-L AT VM.MARIST DOT EDU> wrote on 01/31/2008
10:38:36 AM:

> Thanks Richard,
>
> I was able to find that PA0319 is in slot-element 1344, but I'm not sure
> how to see what is in slot-element 257?  The virtual volume PA0319 was
one
> of the volumes that we were told is damaged and unrecoverable.  In order
> to remove this volume from CDLA_PROD which is defined as a SCSI library
> would I just be able to checkout libvol cdla_prod pa0319, but would I
want
> remove=yes, bulk, or no for a virtual 3984 library? It doesn't look like
> it uses the checkl parameter for this type of library, but would I want
> force=yes or force=no?  Or do I not want to check the volume out at all?
I
> was thinking if I did I could then re-run the audit to see if it runs
> clean.
>
> You are correct.  After this fiasco I do wonder if this virtual library
is
> stable at all.  EMC has stated that it is, but from our 20+ hour outage
> and data loss you can see why I am not too confident in their ability or
> their word on this.
>
> I do have mounts/dismounts occurring for this library and data is being
> written to the virtual volumes.  I'm just wondering if it's having
issues
> with this 1 bad virtual volume...
>
> If anyone has any thoughts/ideas I'm open to suggestion!  EMC hasn't
been
> any help at this point.  Thanks again!
>
> Date/Time             Message
> --------------------
> ----------------------------------------------------------
> 01/31/08 09:31:31     ANR0984I Process 5034 for AUDIT LIBRARY started in
> the
>                        BACKGROUND at 09:31:31. (SESSION: 53460, PROCESS:
> 5034)
> 01/31/08 09:31:31     ANR8457I AUDIT LIBRARY: Operation for library
> CDLA_PROD
>                        started as process 5034. (SESSION: 53460,
PROCESS:
> 5034)
> 01/31/08 09:31:31     ANR0609I AUDIT LIBRARY started as process 5034.
> (SESSION:
>                        53460, PROCESS: 5034)
> 01/31/08 09:32:16     ANR8943E Hardware or media error on library
> CDLA_PROD
>                        (OP=00006C03, CC=-1, KEY=04, ASC=40, ASCQ=02,
> SENSE=F0.0
>  0.04.00.00.00.00.0C.00.00.00.00.40.02.00.00.00.00.00.00.,
>                        Description=An undetermined error has occurred).
> Refer
>                        to Appendix D in the 'Messages' manual for
> recommended
>                        action. (SESSION: 53460, PROCESS: 5034)
> 01/31/08 09:32:16     ANR8942E Could not move volume NOT KNOWN from
> slot-element
>                        1344 to slot-element 257. (SESSION: 53460,
PROCESS:
> 5034)
> 01/31/08 09:32:18     ANR8460E AUDIT LIBRARY process for library
CDLA_PROD
>
>                        failed. (SESSION: 53460, PROCESS: 5034)
>
> ********************************
> Joni Moyer
> Highmark
> Storage Systems, Storage Mngt Analyst III
> Phone Number: (717)302-9966
> Fax: (717) 302-9826
> joni.moyer AT highmark DOT com
> ********************************
>
>
>
> "Richard Sims" <rbs AT BU DOT EDU>
> Sent by: "ADSM: Dist Stor Manager" <ADSM-L AT VM.MARIST DOT EDU>
> 01/31/2008 10:01 AM
> Please respond to
> "ADSM: Dist Stor Manager" <ADSM-L AT VM.MARIST DOT EDU>
>
>
> To
> ADSM-L AT VM.MARIST DOT EDU
> cc
>
> Subject
> Re: ANR8943E Hardware or media error on library CDLA_PROD
>
>
>
>
>
>
> On Jan 31, 2008, at 9:39 AM, Joni Moyer wrote:
>
> > ...
> >  Does anyone know where I can find those codes?
> > ...
> >                        (OP=00006C03, CC=-1, KEY=04, ASC=40, ASCQ=02,
> >
>
> Those codes are probably too general to be useful.  See IBM Technote
> 1253273, for example.
> A good EMC customer engineer may be able to make something out of the
> sense bytes, but that's probably a long shot.
>
> An EMC customer on the List may be able to offer some advice, but
> it's probably the case that the library is simply malfunctioning and
> not properly communicating with TSM.  Check your TSM paths status,
> for completeness.  There may be error conditions reflected in the
> library itself.
>
> Based upon the data loss your site experienced with that library,
> your management should question whether to continue depending upon
> it.  It seems rather shakey to me from that error.
>
>     Richard Sims

<Prev in Thread] Current Thread [Next in Thread>