Networker

Re: [Networker] 7.4.5 - rap error, jukebox not ready.

2009-09-08 19:58:06
Subject: Re: [Networker] 7.4.5 - rap error, jukebox not ready.
From: Rachel Polanskis <r.polanskis AT uws.edu DOT au>
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Date: Wed, 9 Sep 2009 09:53:20 +1000
On Tue, 8 Sep 2009, Khan, Sami wrote:

Thanks again Frank, I am definitely talking to EMC about this one.  Hanging 
savegroups in 7.4.4 were far better than this. I wonder if anyone else is going 
through this.


Hi,
we are running 7.4.2 on a Sun T5220 with an L700 and 6 LTO-3 drives attached.

What is happening in our case is that the system will just present with all the savegrp's waiting for tapes. The system will ask for a bunch of tapes repeatedly until there is intervention.

We have found that as soon as the nsrjb -HH command is elicited, the system will immediately wake up and start loading vols and processing backups.

After running nsrjb -HH again on Sunday afternoon:

Sun 16:03:21 media info: Jukebox `pta-L700e' Hardware status of jukebox 
'pta-L700e' changed from 'cannot
access the hardware' to 'ready'

The system was not scanning its tape drives before this.

If this helps - this log message tells us a time:

Sep  6 15:17:26 lamarr root: [ID 702911 daemon.notice] NetWorker media: 
(warning) Jukebox `pta-L700e'
Hardware status of jukebox 'pta-L700e' changed from 'ready' to 'cannot access 
the hardware'


This is starting to happen more often to the point of a couple of times a day 
(or worse, night)
which means the poor little backup admin struggles with sleep knowing there is a recalcitrant system online that really wants them to get up at 3:30AM and run nsrjb -HH to keep the backups going.

After a reboot, the system settles down for a few days before going AWOL again.


We have a V890 with 7.4.2 and an L700 with LTO-2 drives that does not have this problem very often - it's main issue is that it is getting slower and slower....


If someone has an answer to my problem, that would be very helpful!



rachel


-----Original Message-----
From: Francis Swasey [mailto:Frank.Swasey AT uvm DOT edu]
Sent: 08 September 2009 14:23
To: Khan, Sami
Cc: EMC NetWorker discussion
Subject: Re: [Networker] 7.4.5 - rap error, jukebox not ready.

On 9/8/09 7:54 AM, Khan, Sami wrote:
Thanks Frank. This only happens when I open the Mailbox door to deposit new tapes, we are 
using an ADIC Scalar 100 with three drives. Is there anyway I can make Networker realize 
when the doors are closed and bring the Scalar100 back to "Ready" state?

Sami,
  It's been over a year since I was working with EMC on this issue.  I have a 
Qualstar XLS
library.  There was one iteration of the code fix that did what you are 
describing.  With that
iteration of the fix, there was no way to get NetWorker to realize the jukebox 
was online
again.  I tested that attempted fix but did not put it in production.

  I'd say that you need to contact EMC about a fix -- and I'd say this is a 
severity 1.



--
Rachel Polanskis                Systems Admin, University of Western Sydney
ADD Werrington North Campus     (+61 2) 9678 7291  <r.polanskis AT uws.edu DOT 
au>
   "The perversity of the Universe tends towards a maximum." - Finagle's Law

To sign off this list, send email to listserv AT listserv.temple DOT edu and type 
"signoff networker" in the body of the email. Please write to networker-request 
AT listserv.temple DOT edu if you have any problems with this list. You can access the 
archives at http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER