Networker

Re: [Networker] nsrjobd Jobs error:

2008-06-11 11:29:03
Subject: Re: [Networker] nsrjobd Jobs error:
From: Fazil Saiyed <Fazil.Saiyed AT ANIXTER DOT COM>
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Date: Wed, 11 Jun 2008 10:21:18 -0500
Hello,
Just couple of observations
could you run networker services and or Group that is failing with the 
jobs error with D -9 debug ? ( configure networker services with debug 
on).
You can also accomplish similar results if you run the group via cmd line 
and extra v switches.
Have you consider going to SP2 for Windows ?
What is your jobs db size & retention ( server attributes under Legato), 
please see the release notes for 7.4.2 and adjust.
Do you do periodic index  crosscheck & Networker maintenance ? nsrck, 
nsrim -X etc ?
Thanks







dd1980 <networker-forum AT BACKUPCENTRAL DOT COM> 
Sent by: EMC NetWorker discussion <NETWORKER AT LISTSERV.TEMPLE DOT EDU>
06/11/2008 09:32 AM
Please respond to
NETWORKER AT LISTSERV.TEMPLE DOT EDU


To
NETWORKER AT LISTSERV.TEMPLE DOT EDU
cc

Subject
[Networker] nsrjobd Jobs error:






Hi Guys 

I am running NetWorker 7.4.2 on a Windows 2003 SP1 server, backing up to a 
STK L180 library with 3 tape drives.

The backups seem to hang everynight on the following message:

nsrjobd Jobs error: Unable to find record for job 32327 during an attempt 
to send message to it.

>From the daemon.log i can see the following messages appear before the one 
above:

4154 10/06/2008 20:36:52  nsrmmd#1 Lost connection to Media Database
39078 10/06/2008 20:36:57  savegrp RPC error: Connection lost with server
7087 10/06/2008 20:36:57  savegrp Lost channel with the server (nsrjobd)
32490 10/06/2008 20:36:57  savegrp group Live-Misc-Daily  aborted.
39078 10/06/2008 20:36:59  savegrp RPC error: Connection lost with server
7087 10/06/2008 20:36:59  savegrp Lost channel with the server (nsrjobd)
32490 10/06/2008 20:36:59  savegrp group Live-Exchange-Daily aborted.

I have engaged EMC on the matter, but so far they havent been able to pin 
point the cause of the problem.

So far I have done the following:

stopped NW services and renamed the /nsr/tmp and /nsr/res/jobsdb 
directories.
looked through DNS and /etc/hosts for erraneous entries
Disabled strong authentication
Reduced savegroup parallelism (in case server was overworked)
Reduced server parallelism (in case server was overworked)

Also it is worth knowing that Antivirus does not run at that time.
All NIC and switch ports are set to 1GB full duplex.
All NIC and Teaming software is running on the latest HP drivers

I feel like I have exhausted all avenues.

Does anyone have any ideas or is suffering from the same issues?

+----------------------------------------------------------------------
|This was sent by daniel.damico AT glasshouse DOT com via Backup Central.
|Forward SPAM to abuse AT backupcentral DOT com.
+----------------------------------------------------------------------

To sign off this list, send email to listserv AT listserv.temple DOT edu and 
type 
"signoff networker" in the body of the email. Please write to 
networker-request AT listserv.temple DOT edu if you have any problems with this 
list. You can access the archives at 
http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER



To sign off this list, send email to listserv AT listserv.temple DOT edu and 
type "signoff networker" in the body of the email. Please write to 
networker-request AT listserv.temple DOT edu if you have any problems with this 
list. You can access the archives at 
http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER

<Prev in Thread] Current Thread [Next in Thread>