Networker

Re: [Networker] L700 goes offline - never returns

2010-11-23 21:14:42
Subject: Re: [Networker] L700 goes offline - never returns
From: Rachel Polanskis <r.polanskis AT UWS.EDU DOT AU>
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Date: Wed, 24 Nov 2010 13:13:54 +1100
On Wed, 24 Nov 2010, STANLEY R. HORWITZ wrote:

Rachel,

The following log entry says ...

Nov 24 11:51:17 nsrserverhost root: [ID 702911 daemon.notice] NetWorker
media: (info) Found the library `pta-L700e' already configured but
disabled!

So then the question is, why? I do not know HP-UX, but if you haven't
checked its system logs, do so. I have a feeling something is wrong at a
hardware level.

Yep, it's Solaris 10 actually!   But we also suspect a hardware failure,
but do not know if it is the emulex hba, the switch, the library or the hand!

We trolled the system logs eg debug logs and there is no relevant hardware error message anywhere.


Cheers.....




On 11/23/10 8:56 PM, "r.polanskis AT uws.edu DOT au" <r.polanskis AT uws.edu DOT 
au>
wrote:

On Wed, 24 Nov 2010, STANLEY R. HORWITZ wrote:

Rachel;

Can you log onto the storage node that has that tape library connected
to
it and see if there are any telltale errors in that system's log files?

There's nothing much in our debug logs at all - this is from a diagnostic
run this morning:

Nov 24 11:51:10 nsrserverhost root: [ID 702911 daemon.notice] NetWorker
media: (info) Searching for NetWorker SCSI devices on 'nsrserverhost'...
Nov 24 11:51:10 nsrserverhost root: [ID 702911 daemon.notice] NetWorker
media: (info) Starting dvdetect process on host 'nsrserverhost'...
Nov 24 11:51:16 nsrserverhost root: [ID 702911 daemon.notice] NetWorker
media: (info) Found new LTO Ultrium-3 device /dev/rmt/40cbn on host
<nsrserverhost>. (<HP      Ultrium 3-SCSI  L63S at SCSI Bus 4 Target 2
LUN 0>)
Nov 24 11:51:16 nsrserverhost root: [ID 702911 daemon.notice] NetWorker
media: (info) Found new LTO Ultrium-3 device /dev/rmt/41cbn on host
<nsrserverhost>. (<HP      Ultrium 3-SCSI  L63S at SCSI Bus 4 Target 3
LUN 0>)
Nov 24 11:51:16 nsrserverhost root: [ID 702911 daemon.notice] NetWorker
media: (info) Found new LTO Ultrium-3 device /dev/rmt/42cbn on host
<nsrserverhost>. (<HP      Ultrium 3-SCSI  L63S at SCSI Bus 4 Target 6
LUN 0>)
Nov 24 11:51:16 nsrserverhost root: [ID 702911 daemon.notice] NetWorker
media: (info) Found new LTO Ultrium-3 device /dev/rmt/44cbn on host
<nsrserverhost>. (<HP      Ultrium 3-SCSI  L63S at SCSI Bus 5 Target 1
LUN 0>)
Nov 24 11:51:16 nsrserverhost root: [ID 702911 daemon.notice] NetWorker
media: (info) Found new LTO Ultrium-3 device /dev/rmt/43cbn on host
<nsrserverhost>. (<HP      Ultrium 3-SCSI  L6CS at SCSI Bus 5 Target 3
LUN 0>)
Nov 24 11:51:17 nsrserverhost root: [ID 702911 daemon.notice] NetWorker
media: (info) Found new LTO Ultrium-3 device /dev/rmt/45cbn on host
<nsrserverhost>. (<HP      Ultrium 3-SCSI  L63S at SCSI Bus 5 Target 4
LUN 0>)
Nov 24 11:51:17 nsrserverhost root: [ID 702911 daemon.notice] NetWorker
media: (info) Found the library `pta-L700e' already configured but
disabled!
Nov 24 11:51:17 nsrserverhost root: [ID 702911 daemon.notice] NetWorker
media: (info) Ending dvdetect process on host 'nsrserverhost'...
Nov 24 11:51:19 nsrserverhost root: [ID 702911 daemon.notice] NetWorker
media: (info) Finished searching for new backup devices.
Nov 24 11:51:19 nsrserverhost root: [ID 702911 daemon.notice] NetWorker
media: (info) Jukebox pta-L700e is now enabled.


Even after this the system is telling us it is "not ready"......    We
are suspecting a "hand issue" but I have to get someone
onsite to investigate!


Last night, when the hangup happened, there was no details in any of the
logs - it just hangs up :/



Thanks very much for looking.....




On 11/23/10 7:16 PM, "Rachel Polanskis" <r.polanskis AT UWS.EDU DOT AU> wrote:

Hi,
We have an L700E attached to a Sun T5220, that has mostly been working
OK.

Last week, the system went offline with the error:

39078:nsrjb: RAP error: No jukeboxes are currently usable.

So I killed off the evil NetWorker 7.6.Build.142 software, cleaned out
/nsr/tmp
and restarted with the library coming back "as advertised".

Last night, it decided to ruin my evening by doing it again.

This time, no matter what I do, I cannot get the system online again.


I have rebooted the server/cleared /nsr/tmp and Powered off/IPL the
library several times now.


The system will not come up.     This library is 50Km from where I sit,
so I
cannot just go in and get this checked out easily.   I am getting
someone
to check
the CAP's are closed, but I am certain they are.

No matter what I  try, we always get nsrjb: RAP error: No jukeboxes are
currently usable

Can someone please let me know if they have had a similar issue.

We used to just run nsrjb -HH when this happened but it is not
working for us at all.   No nsrjb commands will pass the RAP error.

So, before I run screaming into the night with this frustrating
problem,
please help!    I can supply info as required but I am totally stuck.


Cheers.

--
Rachel Polanskis        Systems Admin, University of Western Sydney
ADD Werrington North Campus    (+61 2) 9678 7291
<r.polanskis AT uws.edu DOT au>
   "The perversity of the Universe tends towards a maximum." -
Finagle's
Law

To sign off this list, send email to listserv AT listserv.temple DOT edu and
type "signoff networker" in the body of the email. Please write to
networker-request AT listserv.temple DOT edu if you have any problems with
this
list. You can access the archives at
http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER



--
Rachel Polanskis        Systems Admin, University of Western Sydney
ADD Werrington North Campus    (+61 2) 9678 7291  <r.polanskis AT uws.edu DOT 
au>
   "The perversity of the Universe tends towards a maximum." - Finagle's
Law



--
Rachel Polanskis                Systems Admin, University of Western Sydney
ADD Werrington North Campus     (+61 2) 9678 7291  <r.polanskis AT uws.edu DOT 
au>
   "The perversity of the Universe tends towards a maximum." - Finagle's Law

To sign off this list, send email to listserv AT listserv.temple DOT edu and type 
"signoff networker" in the body of the email. Please write to networker-request 
AT listserv.temple DOT edu if you have any problems with this list. You can access the 
archives at http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER