Networker

Re: [Networker] NetWorker appears stuck while checking an index?

2006-10-04 13:07:24
Subject: Re: [Networker] NetWorker appears stuck while checking an index?
From: George Sinclair <George.Sinclair AT NOAA DOT GOV>
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Date: Wed, 4 Oct 2006 12:55:44 -0400
This problem was resolved by first running nsr_shutdown again to safely kill the 'nsrck -ML1' process and bring down the other NetWorker processes. I noticed after I did this that the /nsr/logs/daemon.log file now contained not only the completed entry for the hung client but all subsequent entries as well, followed by
the completion line. So it seems as if maybe something was hung in queue?

Next, I renamed the affected index to name.old and re-started NetWorker. NetWorker processed all the other indexes happily and completed. Next, I renamed the index back again, shutdown and the re-started NetWorker - all OK. I also ran nsrck -L6 against the index. Everything looks good and it reported the same output that was reported in
the log prior. Not sure what happened, but it seems fixed.

George Sinclair wrote:

I restarted our NetWorker server, and /nsr/logs/daemon.log shows that it's been checking a particular client for
quite some time:

10/03/06 10:02:12 nsrck: checking index for 'clientname'

Prior to this, it had completing checking 10 other clients, including one whose index is much larger, but it has a ways to go since we have about 50+ clients. It's running the standard /usr/sbin/nsrck -ML1 now, and it's been running this for a while. Seems it should have completed by now, and should not be taking so long on this client. I'm thinking it's stuck.

I have no idea why it's stuck, but it appears to be. Here's what ps -ef | grep nsr shows:

   root 17911 17903  0 09:51:45 ?        1:52 /usr/sbin/nsrmmdbd
   root 17896     1  0 09:51:12 ?        0:02 /usr/sbin/nsrexecd
   root 17903     1  0 09:51:29 ?        0:28 /usr/sbin/nsrd
   root 17897 17896  0 09:51:13 ?        0:02 /usr/sbin/nsrexecd
root 17900 1 0 09:51:28 ? 0:02 /usr/sbin/lgtolmd -p /nsr/lic -n 1
   root 17914 17903  0 09:51:53 ?        0:00 /usr/sbin/nsrmmd -n 2
   root 17915 17903  0 09:51:56 ?        0:00 /usr/sbin/nsrmmd -n 3
   root 17913 17903  0 09:51:50 ?        0:00 /usr/sbin/nsrmmd -n 1
   root 17916 17903  0 09:51:58 ?        0:00 /usr/sbin/nsrmmd -n 4
   root 17912 17903  0 09:51:47 ?        0:00 /usr/sbin/nsrindexd
   root 17922 17912  0 09:52:06 ?        0:01 /usr/sbin/nsrck -ML1

What should I do?

George



--
George Sinclair - NOAA/NESDIS/National Oceanographic Data Center
SSMC3 4th Floor Rm 4145       | Voice: (301) 713-3284 x210
1315 East West Highway        | Fax:   (301) 713-3301
Silver Spring, MD 20910-3282  | Web Site:  http://www.nodc.noaa.gov/
- Any opinions expressed in this message are NOT those of the US Govt. -
To sign off this list, send email to listserv AT listserv.temple DOT edu and type 
"signoff networker" in the
body of the email. Please write to networker-request AT listserv.temple DOT edu 
if you have any problems
wit this list. You can access the archives at 
http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER

<Prev in Thread] Current Thread [Next in Thread>