Networker

[Networker] NW 7.5.1 Server on Win 2008 cluster - nsrd dies quickly and quietly

2009-08-04 11:40:51
Subject: [Networker] NW 7.5.1 Server on Win 2008 cluster - nsrd dies quickly and quietly
From: John Hope-Bailie <johnhb AT DEMANDDATA.CO DOT ZA>
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Date: Tue, 4 Aug 2009 17:31:38 +0200
Hi,

 

I have a clustered NW 7.5.1 server running on a Win 2008 cluster.  The
client deployment is still underway but for the past month everything
has run well.

 

Suddenly the NW server died.  Closer examination shows that when the
nsrd service is started using cluster manager, it runs for a short
while, (say 20 seconds) and then dies.  It can be seen to be using  a
bit of memory while is starts up, but it issues no messages or entries
into the daemon log at all.

 

We have cleared out the temp folders, cleared away the existing daemon
logs, checked permissions in these folders all without success.

 

We have also tried to start nsrd in debug mode (not sure if we got this
right) but nothing was logged.

 

If we delete the contents of the res folder ( i.e. emulate a clean
install) the service starts up o.k.

 

But after doing mmrecovs from several points in the past, these versions
of the res DBs still result in nsrd being unable to start.

 

It appears that something is corrupted in the res DB, but if so,  it
must have been like this for a while.

 

If so, why did the nsrd not fail sooner.

 

We have a case open with EMC, but if anyone know how to fix this one
please shed some light.

 

Regards,

 

John Hope-Bailie



 


To sign off this list, send email to listserv AT listserv.temple DOT edu and 
type "signoff networker" in the body of the email. Please write to 
networker-request AT listserv.temple DOT edu if you have any problems with this 
list. You can access the archives at 
http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER