Networker

Re: [Networker] Server lost connection problem

2012-06-19 14:03:53
Subject: Re: [Networker] Server lost connection problem
From: "Stanley R. Horwitz" <stan AT TEMPLE DOT EDU>
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Date: Tue, 19 Jun 2012 17:58:28 +0000
Hi Chester,

This seems to occur at different times of the day and night. I agree that 
increasing client parallelism doesn't make much sense, but perhaps it is one of 
those counterintuitive situations. The savegroup parallelism is set  to 10 for 
each savegroup. Auto media management is not enabled on my DD Boost devices, 
but nothing in the logs on the NetWorker server suggess a problem in that 
regard. I am going to ask my SAN manager to look at the DD system to try to 
ascertain if it is in good health, but the daily health report emails I get 
from it do not indicate any sort of a problem.

On 06 19, 2012, at 1:39 PM, Chester Martin wrote:

> Hello,
> At first glance you would think it'd be a timeout issue that adjusting the 
> keep alive values would fix.  Being that you just updated to a new DDOS you 
> may want to see if there are any errors being reported on the data domain 
> side.  Also, see if there is a certain time when these errors happen.  
> Meaning, are there clients that kick off at 5pm and the error happens at 6pm 
> and all the client's backup that was running at that time cancel with the 
> "connection dropped" error?  You didn't mention anything about the data 
> domain devices going offline, but if there was an issue with the networker 
> server not talking to data domain your devices should go offline.  But if you 
> have "auto media management" enabled on the dd devices networker will attempt 
> to bring them back online.
> 
> I would think that increasing the client parallelism would add to the problem 
> instead of help it.  Increasing the client parallelism will cause you to have 
> more streams going to the dd box, which may slow your backup down.  If any 
> parallelism needs to be adjusted I would do it from the group level and not 
> the client level.  Is it possible you have your group parallelism set to 0 
> and some of the clients could be waiting on resources and timing out? 

<Prev in Thread] Current Thread [Next in Thread>