Networker

Re: [Networker] Networker 7.2.2 clients with large files problem

2009-06-10 17:10:01
Subject: Re: [Networker] Networker 7.2.2 clients with large files problem
From: Greg Etling <getling AT STERN.NYU DOT EDU>
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Date: Wed, 10 Jun 2009 17:05:28 -0400
Update: I ran the closest server in question from the command line(savegrp -v -c ie-1 -l full GROUP), and received:

* ie-1:/yesterday lost connection to server, exiting

This is despite the fact that I'm seeing the keepalive notices in nwadmin every 5 minutes as expected.

Greg
---
Greg Etling <getling AT stern.nyu DOT edu <mailto:getling AT stern.nyu DOT edu>>
Systems Administrator
Stern IT Enterprise Operations
NYU Stern School of Business

212-998-0746

Greg Etling wrote:
James,

These are not NDMP, and some are through a firewall, some are not. The reason that it is so is that some are backing up over a VLAN from a datacenter ~1 mile away. However, the biggest issues have been with the server that is adjacent to the backup server.

To expand on the NSR_KEEPALIVE_WAIT, this did appear to have an effect when I invoke savegrp manually (I see the "group running on client" messages in nwadmin), but not when run automatically. It was invoked because of the large files problem that predated me and predated the firewall hardware. It probably even predated the datacenter move that separated our backup hardware from some of our systems.

David,

Thanks for the tip - I had bumped the inactivity timeout to 90, but I'll crank it all the way out. The savesets are listed as aborted, and often have significantly divergent sizes (>5 GB) among the retries, even though the server mountpoint is static throughout the day.

Greg

---
Greg Etling <getling AT stern.nyu DOT edu <mailto:getling AT stern.nyu DOT edu>>
Systems Administrator
Stern IT Enterprise Operations
NYU Stern School of Business

212-998-0746

Browning, David wrote:
Check the "inactivity timeout" field in the group definitions.
For our large servers, we put this to 1000, and that seems to work for
us.
Also, check your indexes to make sure that it isn't really completing.
Sometimes you can get that message, but you will find that the backup
successfully finished, and is stored in the index.
David M. Browning Jr.
IT Project Coordinator Enterprise Backups and Help Desk
James T Proctor wrote:

Are these ndmp? Are they going through a firewall?

Jim Proctor
IT Specialist
USGS/NGTOC III
Rolla, Missouri
jproctor AT usgs DOT gov
(573)308-3521




From:     Greg Etling <getling AT STERN.NYU DOT EDU>
To:     NETWORKER AT LISTSERV.TEMPLE DOT EDU
Date:     06/10/2009 10:59 AM
Subject: [Networker] Networker 7.2.2 clients with large files problem
Sent by:     EMC NetWorker discussion <NETWORKER AT LISTSERV.TEMPLE DOT EDU>


------------------------------------------------------------------------



Hello,

I am running into a problem with scheduled backups on a Networker 7.2.2
server. By setting NSR_KEEPALIVE_WAIT=300 on the clients, I am able to
run full backups of the systems in question but the backup fails when
the scheduled backup runs. The common thread for these systems is the
existence of large files (> 1GB), and they are failing their scheduled
backups the same way as they used to before the keepalive was added:

--- Unsuccessful Save Sets ---

* ie-1:/yesterday 1 retry attempted


I'll run a verbose save tonight to get more details, but I'm curious
what other next steps might be recommended given that it is only a
problem for scheduled backups.

And yes, I know I need to upgrade - working on that as well. Thanks.

Greg
---
Greg Etling <getling AT stern.nyu DOT edu <mailto:getling AT stern.nyu DOT edu>>
Systems Administrator
Stern IT Enterprise Operations
NYU Stern School of Business

212-998-0746

To sign off this list, send email to listserv AT listserv.temple DOT edu and type "signoff networker" in the body of the email. Please write to networker-request AT listserv.temple DOT edu if you have any problems with this list. You can access the archives at http://listserv.temple.edu/archives/networker.html or via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER <http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER>





To sign off this list, send email to listserv AT listserv.temple DOT edu and type 
"signoff networker" in the body of the email. Please write to networker-request 
AT listserv.temple DOT edu if you have any problems with this list. You can access the 
archives at http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER