Networker

[Networker] BIT by occasional hang of nsrjb @ auto label/mount of new tape

2004-10-17 14:19:12
Subject: [Networker] BIT by occasional hang of nsrjb @ auto label/mount of new tape
From: Michael Coxe <mcoxe AT OPSWARE DOT COM>
To: NETWORKER AT LISTMAIL.TEMPLE DOT EDU
Date: Sun, 17 Oct 2004 11:18:19 -0700
On more occasions than I would prefer (as now when fulls are in
progress), the odd Legato generated nsrjb command to label (read
prelabeled tape) & mount for use ba waiting backup process hangs.
A simple "kill -15 PID" kills the hung nsrjb command and the next
auto-generated nsrjb on the same tape usually works successfully.

Version: Networker 6.1.3 on Solaris 5.7 with a Spectralogic 12000
jukebox with AIT-2 drives.

legato # jbps
 UID   PID  PPID  C    STIME TTY     TIME    CMD
root 20922 24354 46 21:18:03 ?     733:08    /usr/sbin/nsrjb
             -j Gator -Y -O273 -L -B -m -G -M -bNDMP Pool -S 18

Any known cause of this problem?

And is there a way to set Networker to timeout these obviously hung
processes and either restart or move on to another slot.  Hacks or
worakarounds gracefully accepted too.

This bites me at least twice a week.

 - michael

--
  Michael Coxe
  <mcoxe AT opsware DOT com>

--
Note: To sign off this list, send a "signoff networker" command via email
to listserv AT listmail.temple DOT edu or visit the list's Web site at
http://listmail.temple.edu/archives/networker.html where you can
also view and post messages to the list. Questions regarding this list
should be sent to stan AT temple DOT edu
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=

<Prev in Thread] Current Thread [Next in Thread>
  • [Networker] BIT by occasional hang of nsrjb @ auto label/mount of new tape, Michael Coxe <=