Networker

Re: [Networker] Hung savegroup problem

2010-05-26 09:46:52
Subject: Re: [Networker] Hung savegroup problem
From: "Clark, Patti" <clarkp AT OSTI DOT GOV>
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Date: Wed, 26 May 2010 09:45:01 -0400
v7.4.3 on RHEL4 here, same issue on two separate systems.  I found the 
frequency to increase if I'd made some recent config changes, i.e modify 
settings, add clients, etc via NMC.  I also found as suggested in other 
postings that not running NMC on the backup server reduced the occurrences.

Patti Clark
Sr Linux System Admin
DOE/OSTI 

> -----Original Message-----
> From: EMC NetWorker discussion 
> [mailto:NETWORKER AT LISTSERV.TEMPLE DOT EDU] On Behalf Of Stephanie Finnegan
> Sent: Wednesday, May 26, 2010 9:22 AM
> To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
> Subject: Re: [Networker] Hung savegroup problem
> 
> We're also running 7.4.4.4, we have the same issue, and we do 
> exactly what Paul stated.   We start to see little weird 
> things here and there and realize several weeks have gone by, 
> we know it's time to clear out tmp, rename jobsdb and restart 
> the clock.  
> 
> It's frustrating, because it's treating the symptoms and not 
> addressing the cause, but we realized we weren't going to get 
> a treatment for the cause and this is a minor inconvenience 
> that buys us a month or more of stability.  
> 
> -----Original Message-----
> From: EMC NetWorker discussion 
> [mailto:NETWORKER AT LISTSERV.TEMPLE DOT EDU] On Behalf Of Goslin, Paul
> Sent: Wednesday, May 26, 2010 7:48 AM
> To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
> Subject: Re: [Networker] Hung savegroup problem
> 
> We are also running 7.4.4 and experience this once in a great 
> while....
> But not often enough that it becomes a problem. What we see every 6-8
> weeks or so, is when our daily 'index' group runs that backs 
> up all our
> client indexes, I start it to run with 7 concurrent sessions, but it
> only runs one stream at a time instead of 7. That's when I know it's
> time to refresh our environment (stop & restart the server 
> processes...
> ). I also remove \nsr\tmp and rename \nsr\res\jobsdb\ to jobsdb.old
> while the processes are stopped. They'll get re-created when 
> you restart
> the Networker service. After restarting, things are fine for another 6
> to 8 weeks.... This was once suggested by someone @ EMC 
> support to fix a
> group hanging problem. It seems to help with other symptoms 
> where things
> hang for no apparent reason or begin to miss-behave. 
> 
> -----Original Message-----
> From: EMC NetWorker discussion 
> [mailto:NETWORKER AT LISTSERV.TEMPLE DOT EDU] On
> Behalf Of STANLEY R. HORWITZ
> Sent: Wednesday, May 26, 2010 2:32 AM
> To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
> Subject: [Networker] Hung savegroup problem
> 
> Every once in a while, a savegroup on my Linux NetWorker 7.4.4 server
> hangs. It always seems to hang on one client's backup, but it never
> seems to be the same client. For example, tonight I happened to notice
> that a savegroup hung on a backup of client x. I killed that savegroup
> and then I restarted it. The same savegroup hung on a client 
> y. No other
> savegroups were running at this time. In both cases, there was no
> discernible backup activity. All the clients in this particular group
> are Linux file servers, except for one that is Windows 2003. There are
> 36 clients in this group. 
> 
> This is a fairly simple setup with a Qualstar tape library with four
> LTO-3 tape drives connected via fiber to the Red Hat AS 4.5 
> Linux backup
> server. 
> 
> I actually had a case open with EMC about this issue and their only
> response was to upgrade to 7.5 on the clients if it happens 
> again. Would
> that really help? I am wondering what other people are doing 
> to address
> this problem if it still is present in later NetWorker versions.
> 
> To sign off this list, send email to listserv AT listserv.temple DOT edu and
> type "signoff networker" in the body of the email. Please write to
> networker-request AT listserv.temple DOT edu if you have any 
> problems with this
> list. You can access the archives at
> http://listserv.temple.edu/archives/networker.html or
> via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER
> 
> To sign off this list, send email to 
> listserv AT listserv.temple DOT edu and type "signoff networker" in 
> the body of the email. Please write to 
> networker-request AT listserv.temple DOT edu if you have any 
> problems with this list. You can access the archives at 
> http://listserv.temple.edu/archives/networker.html or
> via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER
> 
> To sign off this list, send email to 
> listserv AT listserv.temple DOT edu and type "signoff networker" in 
> the body of the email. Please write to 
> networker-request AT listserv.temple DOT edu if you have any 
> problems with this list. You can access the archives at 
> http://listserv.temple.edu/archives/networker.html or
> via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER
> 

To sign off this list, send email to listserv AT listserv.temple DOT edu and 
type "signoff networker" in the body of the email. Please write to 
networker-request AT listserv.temple DOT edu if you have any problems with this 
list. You can access the archives at 
http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER

<Prev in Thread] Current Thread [Next in Thread>