Re: [Networker] v7.4.2 to v7.4.4
2009-03-03 10:37:22
Hello,
Just to confirm, we have jobsdb issues as well, we are on 7.3.3,
typically, we receive jobdb errors, hung index backups etc, rebooting and
cleaning up temp and jobdb helps.
I hope emc would put out the fixes for it quickly, as David pointed out,
frequent reboot is not an option in many environments.
Could a rearchitecture of Networker help, i,e Master server only holds
index,media dB & nsr files, which can be either replicated to SN or SN's
can continue to write to the local dB and sync up when master server is
available after reboot, this would mean , minimum requirement of two
networker servers for all environment and also ability to recover one or
the other server's dB for recovery ( clustered SQL ) ? could this be
conceivable ? other vendors currently do such clustering already.
Thanks
"Browning, David" <DBrown AT LSUHSC DOT EDU>
Sent by: EMC NetWorker discussion <NETWORKER AT LISTSERV.TEMPLE DOT EDU>
03/03/2009 08:57 AM
Please respond to
EMC NetWorker discussion <NETWORKER AT LISTSERV.TEMPLE DOT EDU>; Please respond
to
"Browning, David" <DBrown AT LSUHSC DOT EDU>
To
NETWORKER AT LISTSERV.TEMPLE DOT EDU
cc
Subject
Re: v7.4.2 to v7.4.4
We are experiencing similar problems with jobs hanging in 7.4.4 (in
7.2.2 they ran without issue). Legato seems clueless.
Like you, I've done all I can with parameter changes, jobd size change,
etc.
For now, we do a daily, or every other day, reboot. We are fortunate
enough to have available time during the daytime hours to do a reboot.
I'd hate to see what would happen if we didn't have that luxury.
Anybody using 7.5?
David M. Browning Jr.
IT Project Coordinator Enterprise Backups and Help Desk
-----Original Message-----
From: EMC NetWorker discussion [mailto:NETWORKER AT LISTSERV.TEMPLE DOT EDU] On
Behalf Of soupdragon
Sent: Saturday, February 28, 2009 4:21 PM
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Subject: [Networker] v7.4.2 to v7.4.4
We recently upgraded our heavily loaded Solaris 8 servers with 800 or so
clients from 7.2.2 to 7.4.3 - then 7.4.4 in order to fix some bugs.
Upgrade fairly straightforward - in fact 7.4.3 to 7.4.4 took a matter of
minutes - just had to ensure our /usr/kernel/lus.conf modifications were
reapplied.
Mixed feelings about new release - certainly prettier but seems to be
much less robust than 7.2.2 and needs constant babysitting. I believe
this is to do with the jobd functionality - we have applied all
recommended tcp parameter changes, increased client parallelism to max
for the server client resource and increased the size of the jobsd
database as recommended elsewhere.
Still savegroups hang at 99% complete yet succeed on rerun, this gets
worse over time until the Networker service is restarted - we find we
have to do this once a fortnight at least. As it is restarted we see a
number of daemon messages regarding save sessions being marked as
complete in jobdb - this seems to fix the issues for a while. But it
does mean we cannot leave Networker to run unattended anymore.
Other annoyances are:
nsrjb -L -R option which is supposed to allow silent relabelling of
recyclable tapes no longer works.
certain client initiated saves do not appear in the NMC sessions window
- yet nsrwatch shows them (may be to do with lack of group in save
command - need to check)
All indexes are re-saved on a rerun even if only 1 client failed
The new group restart interval (we set to 23:59) precludes rerunning a
group that happens to exceed it's 24 hr window - this was never a
problem on 7.2
nwrecover GUI no longer shows the target for symbolic links - used to,
and command line recover still does.
Good things:
Automatic inventory of tapes known to media database when deposited
Ability to sort columns in the GUI, and reporting of session data rates
- very useful
Operations on multiple volumes in the GUI almost obviates the need for
csutom scripts to label tapes etc.
+----------------------------------------------------------------------
|This was sent by julian.barnett AT standardchartered DOT com via Backup
Central.
|Forward SPAM to abuse AT backupcentral DOT com.
+----------------------------------------------------------------------
To sign off this list, send email to listserv AT listserv.temple DOT edu and
type "signoff networker" in the body of the email. Please write to
networker-request AT listserv.temple DOT edu if you have any problems with this
list. You can access the archives at
http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER
To sign off this list, send email to listserv AT listserv.temple DOT edu and
type
"signoff networker" in the body of the email. Please write to
networker-request AT listserv.temple DOT edu if you have any problems with this
list. You can access the archives at
http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER
To sign off this list, send email to listserv AT listserv.temple DOT edu and
type "signoff networker" in the body of the email. Please write to
networker-request AT listserv.temple DOT edu if you have any problems with this
list. You can access the archives at
http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER
|
|
|