Networker

Re: [Networker] L700 goes offline - never returns

2010-11-24 05:33:53
Subject: Re: [Networker] L700 goes offline - never returns
From: Rachel Polanskis <R.Polanskis AT UWS.EDU DOT AU>
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Date: Wed, 24 Nov 2010 10:31:48 +0000
On 24/11/2010, at 8:14 PM, "Davina Treiber" <Davina.Treiber AT PeeVRo.co DOT 
uk> wrote:

> On 11/24/10 00:16, Rachel Polanskis wrote:
>> Hi,
>> We have an L700E attached to a Sun T5220, that has mostly been working OK.
>> 
>> Last week, the system went offline with the error:
>> 
>> 39078:nsrjb: RAP error: No jukeboxes are currently usable.
>> 
>> So I killed off the evil NetWorker 7.6.Build.142 software, cleaned out
>> /nsr/tmp and restarted with the library coming back "as advertised".
>> 
>> Last night, it decided to ruin my evening by doing it again.
>> 
>> This time, no matter what I do, I cannot get the system online again.
>> 
>> 
>> I have rebooted the server/cleared /nsr/tmp and Powered off/IPL the
>> library several times now.
>> 
>> 
>> The system will not come up.     This library is 50Km from where I sit,
>> so I cannot just go in and get this checked out easily.   I am getting
>> someone to check the CAP's are closed, but I am certain they are.
>> 
>> No matter what I  try, we always get nsrjb: RAP error: No jukeboxes are
>> currently usable
>> 
>> Can someone please let me know if they have had a similar issue.
>> 
>> We used to just run nsrjb -HH when this happened but it is not working
>> for us at all.   No nsrjb commands will pass the RAP error.
>> 
>> So, before I run screaming into the night with this frustrating problem,
>> please help!    I can supply info as required but I am totally stuck.
> 
> You need to diagnose this at the storage node. Login and run sjirdtag
> b.t.l (or one of the other low level commands). If this is Solaris 10
> you will need to stop NetWorker on the storage node first because the
> nsrlcpd process will lock the device.
> 
> If the library reports the list of drives/slots etc. then the problem is
> not at a hardware level. You may find that the device name has changed,
> in which case you need to change the address in the library config, and
> it means you also need to implement persistent binding if you haven't
> already done so.

Hi Davina,
sjirdtag gives us an io error when it is executed.  It does look like a 
mechanical fault
with the hand "grabber" inside the L700.....

Thanks!   

To sign off this list, send email to listserv AT listserv.temple DOT edu and 
type "signoff networker" in the body of the email. Please write to 
networker-request AT listserv.temple DOT edu if you have any problems with this 
list. You can access the archives at 
http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER