Networker

Re: [Networker] backing up large files, nsrexec get timeouts

2006-04-19 16:50:26
Subject: Re: [Networker] backing up large files, nsrexec get timeouts
From: Stan Horwitz <stan AT TEMPLE DOT EDU>
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Date: Wed, 19 Apr 2006 16:45:53 -0400
On Apr 19, 2006, at 7:55 AM — 4/19/06, Marcelo Bartsch wrote:

Hi people, today , for the last week we are unable to backup a client
with a file system of 240GB over 100mb WAN link. always i got
connections timed out, i test the link doing a a ftp file transfer from the host to the storage node, and it works, and the ftp was with 5 80GB
files each , wich i was also unable to backup with networker.

server = Solaris 9
SN = RedHat ES 3
client = Solaris 9

There are several avenues to pursue. First, make sure you have a high enough value in the "inactivity timeout" value in the group resource that starts this client's backup. This may be the only change you need to make .. if you're lucky.

You may also need to adjust the TCP/IP parameters on the NetWorker client and server but that can get pretty tricky. If your data goes to your storage node, you would need to set similar values there. On the Solaris NetWorker backup server that I run, I get good results with the following settings:

/usr/sbin/ndd -set /dev/tcp tcp_time_wait_interval 30000
/usr/sbin/ndd -set /dev/tcp tcp_conn_req_max_q 1024
/usr/sbin/ndd -set /dev/tcp tcp_keepalive_interval 600000
/usr/sbin/ndd -set /dev/tcp tcp_strong_iss 2
/usr/sbin/ndd -set /dev/tcp tcp_smallest_anon_port 8192
/usr/sbin/ndd -set /dev/tcp tcp_deferred_ack_interval 5
/usr/sbin/ndd -set /dev/tcp tcp_ip_abort_cinterval 10000
/usr/sbin/ndd -set /dev/tcp tcp_rexmit_interval_initial 500
/usr/sbin/ndd -set /dev/tcp tcp_ip_abort_interval 60000

If I remember correctly, the most important parameter for backup purposes is keepalive_interval.

If a firewall sits between the client and server, you may need to make adjustments on how the firewall is configured. You may also need to make adjustments on your NetWorker server and client by setting the

NSR_KEEPALIVE_WAIT=43

I am using a value of 43, but some other value might be more beneficial in your environment.

To sign off this list, send email to listserv AT listserv.temple DOT edu and type 
"signoff networker" in the
body of the email. Please write to networker-request AT listserv.temple DOT edu 
if you have any problems
wit this list. You can access the archives at 
http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER

<Prev in Thread] Current Thread [Next in Thread>