Networker

[Networker] Problem with nsrjb processes

2003-05-25 08:50:21
Subject: [Networker] Problem with nsrjb processes
From: Stan Horwitz <stan AT TEMPLE DOT EDU>
To: NETWORKER AT LISTMAIL.TEMPLE DOT EDU
Date: Sun, 25 May 2003 07:44:57 -0400
About a week ago, I did a migratation of our the Legato server here from a
system running Tru64 Unix 4.0f with NetWorker Power Edition 6.1.1 to Power
Edition 6.1.3 under Solaris 9 on a Sun Enterprise 450, For a variety of
reasons, this migration has been very painful. Now, for the past three days,
I have had a problem with where nsrjb processes seem to hang. No tapes are
mounted, none are unmounted, yet there are numerous tapes that are eligible
for use, at least 15 of which are brand new. The problem seems to start at
about 8:00pm. Until then, backups run fine and tapes mount and unmount
fine. Then the system slows to a crawl where Legato is concerned, such as
taking several minutes to open up nsrwatch.

This is with a 600 slot Qualstar 412600 tape library that has 6 AIT-2 tape
drives in the left side and six new ones that I am about to configure (but
haven't yet) in the right side. I opened a call with Legato about this
problem on Thursday morning, but Legato has not provided a solution yet,
ahtough they have requested lots of daemon.log and debugging data. So far,
they say they see nothing wrong with the debug data, but the debug session
was run during the day when this situation does not happen.

I am wondering if anyone on this list has any suggestions on how I might deal
with this problem.

When this happens, system utilization seems normal, as in this "top"
display:

last pid: 11172;  load averages:  1.54,  1.53,  1.62
458 processes: 442 sleeping, 2 running, 13 zombie, 1 on cpu
CPU states:  0.0% idle, 76.9% user, 23.1% kernel,  0.0% iowait,  0.0% swap
Memory: 1024M real, 372M free, 505M swap in use, 2220M swap free

   PID USERNAME THR PRI NICE  SIZE   RES STATE    TIME    CPU COMMAND
 13845 root       1  20    0   77M   76M run     21.2H 61.11% nsrd
 13851 root       1   3   10   46M   45M run    529:19 27.03% nsrmmdbd
 11166 root       1  39    0 3024K 1904K cpu      0:00  1.40% top
 13859 root       2  59  -15   14M 9800K sleep   53:25  1.33% nsrmmd
 13839 root       1  59    0 5408K 4000K sleep    1:38  0.06% nsrexecd
 13838 root       1  59    0 4248K 2624K sleep    1:55  0.06% nsrexecd
  3821 root       1  59    0 3296K 2120K sleep    0:00  0.05% nsrexec
  7449 root       1  59    0 3296K 2120K sleep    0:00  0.04% nsrexec
   292 root       1  59    0 3536K 2008K sleep    0:24  0.03% sshd2
   428 root       1  59  -15 4992K 3752K sleep    0:00  0.02% nsrmmd
 22973 root       1  59    0 3296K 2120K sleep    0:01  0.02% nsrexec
 24242 root       1  59    0 3296K 2120K sleep    0:01  0.02% nsrexec
 23028 root       1  59    0 3296K 2120K sleep    0:01  0.02% nsrexec
 21304 root       1  59    0 3296K 2120K sleep    0:01  0.02% nsrexec
 21468 root       1  59    0 3296K 2120K sleep    0:01  0.02% nsrexec

--
Note: To sign off this list, send a "signoff networker" command via email
to listserv AT listmail.temple DOT edu or visit the list's Web site at
http://listmail.temple.edu/archives/networker.html where you can
also view and post messages to the list.
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=

<Prev in Thread] Current Thread [Next in Thread>