Networker

Re: [Networker] problems from upgrade from 7.2.2 to 7.4.3

2008-10-23 12:01:37
Subject: Re: [Networker] problems from upgrade from 7.2.2 to 7.4.3
From: Roberta Gold <gold11 AT LLNL DOT GOV>
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Date: Thu, 23 Oct 2008 09:01:26 -0700
Wow! Finally someone with the same problems as us!!! Especially item #3, which EMC has been working since our upgrade to 7.3 ... 7.3.1 ... 7.3.2 ... 7.3.3 ... 7.4.1 ... 7.4.2. Apparently it is related to the media index not being ready when it tries to do mounts. Of course the autochanger volumes will retry until successful, but no such retries for advfile volumes ...

Our work-around is to disable both jukeboxes and unmount all advfile volumes before shutting down NetWorker. After restarting NetWorker we wait for "media db open for business' before enabling our devices. Pain in the ...

Anyway, this only works for planned outages. When NetWorker goes down without time to do above, we have to do the workaround after it comes up, and stop/restart again!

I will post case numbers later. I am busy for the next 30 minutes ...

Oh yes. We also experienced 1, 2, 4, & 5 ...




Hey Guys,



Last Thursday I upgrade from 7.2.2 to 7.4.3. It has been less than
smooth so far.



It initial seemed to go flawlessly, but Monday morning nsrd crashed and
would not stay running.  EMC provided a hotfixed nsrd that seems to have
resolved that problem, but I have some other less critical problems that
I was wondering if you guys have seen.



1)      Adv_file devices keep randomly unmounting.  I've seen in the
archives people having issues with RO devices, but in my case it is any
device RW or RO.  There isn't any message any the log about the dismount
just that it notifies me if it needs it mounted.

2)      'Owner notification' either doesn't work, or the functionality
has changed.  My existing scripts don't work with it.  For
troubleshooting, I've made a very simple script that basically takes
stdin and writes it to a file.  That doesn't work either.

3)      Media that is labeled and previously working will not mount.
I'll get a message about "volume xxxxxx(volid xxxxxxxxxxx) NOT in media
index".  But then after awhile it will mount, after no intervention on
my part.  This is happening on tapes within a silo and on my adv_file
type devices that keep unmounting.  May be related to the first problem.

4)      Many, not all, savegroups are not finishing and the jobs that
are just hanging out are typically index saves.

5)      Nsrjb shows empty slots... which in not normal for an acsls
silo.  It allows me to allocate the "volumes" in those slots, but the
volumes are not actually in the silo.  In previous releases, a volume
could not be allocated to a silo unless in was physically in the silo.
I'm assuming this is a bug not a design change.



Any assistance would be appreciated.



FYI... I do have a case open with EMC to address them.



Thanks!



Joel


To sign off this list, send email to listserv AT listserv.temple DOT edu and type "signoff networker" in the body of the email. Please write to networker-request AT listserv.temple DOT edu if you have any problems with this list. You can access the archives at http:// listserv.temple.edu/archives/networker.html or
via RSS at http:// listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER


--
Roberta Gold
Lawrence Livermore National Laboratory
ICC/HPSD - Security Technologies Group
gold11 AT llnl DOT gov
(925) 422-0167

To sign off this list, send email to listserv AT listserv.temple DOT edu and type 
"signoff networker" in the body of the email. Please write to networker-request 
AT listserv.temple DOT edu if you have any problems with this list. You can access the 
archives at http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER

<Prev in Thread] Current Thread [Next in Thread>