Networker

[Networker] Handling Savegroup Completion Notices

2007-07-06 14:44:24
Subject: [Networker] Handling Savegroup Completion Notices
From: nsr admin <nsradmin AT GMAIL DOT COM>
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Date: Fri, 6 Jul 2007 13:31:27 -0500
We have about 500 savegroups that run each day.  Back in the day when we
only had about 20 of them, having our operations center monitor them was not
a problem.  However now they are about ready to take me out back and flog
me.   I was wondering how others with this many savegroups handle monitoring
for failures, etc.   Do you have people manually check the email notices, or
do you use scripts or a monitoring system to check them?

Here's basically what I'm trying to accomplish.

* Ensure all scheduled savesets are running.  Alert on any that have not ran
in the last 24 hours.  I've ran across a few instances where savegroups in
Networker that will go off in lala land and simply not run.  There is
nothing in the logs, no notifications sent, they simply don't run. Had I not
had our ops center watching this, they would have not been found.  I've not
been able to reproduce this behavior, so opening a case has been
difficult.

* Alert on any failures or aborts

* Alert on any backups that take more than 24 hrs.


If anyone has anything they wouldn't mind sharing, I would appreciate it.

To sign off this list, send email to listserv AT listserv.temple DOT edu and type 
"signoff networker" in the body of the email. Please write to networker-request 
AT listserv.temple DOT edu if you have any problems with this list. You can access the 
archives at http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER

<Prev in Thread] Current Thread [Next in Thread>