Networker

Re: [Networker] eject/unload/etc. timeouts for SDLT in STK jukebox

2003-10-30 07:25:38
Subject: Re: [Networker] eject/unload/etc. timeouts for SDLT in STK jukebox
From: Scott Clous <scott.clous AT CCCI DOT ORG>
To: NETWORKER AT LISTMAIL.TEMPLE DOT EDU
Date: Thu, 30 Oct 2003 07:25:22 -0500
try ejecting them one at a time?

It could also be that you have a bad drive.  I suspected that but in a similar 
situation didn't track which unit consistently had problems... once I did it 
was obvious (and you could review your logs) that the drive was the problem, 
not legato.



-----Original Message-----
From: Legato NetWorker discussion
[mailto:NETWORKER AT LISTMAIL.TEMPLE DOT EDU]On Behalf Of Tim Mooney
Sent: Wednesday, October 29, 2003 6:02 PM
To: NETWORKER AT LISTMAIL.TEMPLE DOT EDU
Subject: [Networker] eject/unload/etc. timeouts for SDLT in STK jukebox


All-

Since switching to SDLT1 drives from DLT7000 last year, we've had
intermittent problems with ejecting tapes.  We initiate backups using
the `savegrp' command from a script run via cron.  As soon as the last
savegroup finishes, the script executes:

        $nsrbindir/nsrjb -u 2>/dev/null || true

Occassionally, instead of all drives ejecting and unloading correctly,
we'll have one drive "at random" fail to eject, and then we get repeated
messages like:

10/23/03 14:45:24 nsrd: media warning: /dev/nst0 opening: Device or resource 
busy
10/23/03 14:47:25 nsrd: media info: unload error for jukebox `STK9710' 
detected.  Retrying
10/23/03 14:47:25 nsrd: media info: unload retry for jukebox `STK9710': 
sleeping 30 seconds
10/23/03 14:47:56 nsrd: media info: unload retry for jukebox `STK9710' failed - 
will retry again.



And then we get this cycle of the last three messages, repeated for
several minutes, followed by:

10/23/03 14:57:45 nsrd: media info: unload retry for jukebox `STK9710' failed - 
time limit exceeded: aborting.


As soon as this cycle completes, I can use the GUI to select the drive that
is failing to unload, click the "Unload" button, and the drive unloads and
the tape is returned to its source slot without trouble.

I've tried increasing the eject/unload (as well as load/deposit/withdraw,
though we haven't had any problems there) timeouts in the Media->Jukeboxes
screen of the GUI, but if anything the problems have gotten worse, not
better.

Anyone out there using SDLT1 drives in an STK jukebox that would care to
share what they're using for eject/unload timeouts for their jukebox?

Thanks!

Tim
--
Tim Mooney                              mooney AT dogbert.cc.ndsu.NoDak DOT edu
Information Technology Services         (701) 231-1076 (Voice)
Room 242-J6, IACC Building              (701) 231-8541 (Fax)
North Dakota State University, Fargo, ND 58105-5164

--
Note: To sign off this list, send a "signoff networker" command via email
to listserv AT listmail.temple DOT edu or visit the list's Web site at
http://listmail.temple.edu/archives/networker.html where you can
also view and post messages to the list.
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=

--
Note: To sign off this list, send a "signoff networker" command via email
to listserv AT listmail.temple DOT edu or visit the list's Web site at
http://listmail.temple.edu/archives/networker.html where you can
also view and post messages to the list.
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=