Networker

Re: [Networker] Maximum Active Devices error

2007-11-20 11:05:52
Subject: Re: [Networker] Maximum Active Devices error
From: Francis Swasey <Frank.Swasey AT UVM DOT EDU>
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Date: Tue, 20 Nov 2007 10:58:42 -0500
On 11/20/07 10:39 AM, Fazil.Saiyed AT anixter DOT com wrote:

Hello,
Can you please elaborate what happened since they were working, what changed, was any patches applied, what is your typical configurations, what happens when you try to enable them, what are the errors, can you check the daemon logs, are the required daemons running ? are the drivers loaded,Can you include some of the important logs here ?
There are lots of questions right now.
Thanks

The storage node was installed at the end of June and has always had two LTO-4 tape drives attached to it. Those two tape drives are in a 10 drive Qualstar XLS jukebox. The robotics are attached to the NW Server, the 10 drives are spread in pairs across five RHEL 4 storage nodes.

Over the weekend, this storage node wound up with two tapes mounted and then for an as yet undiscovered reason (ie, there is nothing about any failure in the /nsr/logs/daemon.log file on the NW server) that decided that it was not allowed to have active devices and those two tapes that were required by running savegrp commands became trapped.

During the triage process we had to shutdown the nsr processes on this storage node to free the tapes, and of course it is working now -- but with no assurance it will not happen again since no one knows why it happened to begin with.

Yes, the nsrmmd processes were running (I would have been paged by our nagios monitoring system had the nsr processes not been running).

The daemon.log file contains nothing that relates to this storage node having any failure. The tapes were loaded into the drives on Saturday, and Monday morning, the following was recorded in the NW server's daemon.log

11/19/07 04:37:14 nsrd: [Jukebox `Qualstar XLS', operation 246]. Initiated operation `/usr/sbin/nsrjb -s ozzie.uvm.edu -l -f rd=stornode8.uvm.edu:/dev/nst1 UVM.5073'.

11/19/07 04:37:17 nsrd: [Jukebox `Qualstar XLS', operation # 246]. Slot `152' is loaded into unavailable device `rd=stornode7.uvm.edu:/dev/nst0'. Reached the maximum number of active devices on storage node.

11/19/07 04:37:19 nsrd: [Jukebox `Qualstar XLS', operation # 246]. Finished with status: failed

UVM.5073 was one of the two tapes stuck in stornode7's tape drives.

--
Frank Swasey                    | http://www.uvm.edu/~fcs
Sr Systems Administrator        | Always remember: You are UNIQUE,
University of Vermont           |    just like everyone else.
  "I am not young enough to know everything." - Oscar Wilde (1854-1900)

To sign off this list, send email to listserv AT listserv.temple DOT edu and type 
"signoff networker" in the body of the email. Please write to networker-request 
AT listserv.temple DOT edu if you have any problems with this list. You can access the 
archives at http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER

<Prev in Thread] Current Thread [Next in Thread>