Networker

Re: [Networker] 7.4 SP5 2 GB chunking of savesets - IMPORTANT

2009-11-06 13:44:36
Subject: Re: [Networker] 7.4 SP5 2 GB chunking of savesets - IMPORTANT
From: Mark Davis <davism AT UWO DOT CA>
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Date: Fri, 6 Nov 2009 13:39:08 -0500
Nelson and group,

After a lot of thrashing, I was able to get an EMC engineer on the phone. He told me something I haven't seen mentioned in this group. If you have backups being broken up into 2GB chunks, THEY WILL NOT BE RECOVERABLE!

He explained that the first saveset in the chunked backup will recover, but then the rest of the pieces of the backup will not. Clearly a major issue for anyone out there experiencing this problem.

Our environment is a Solaris 10 server/storage node, with 7.4.4 Build 634. The quick fix for me was to go back to an older client. I've just tested this, and using client version 7.4 SP4 Build 634 fixed the issue, at least in our environment.

EMC is saying the problem relates to authentication, and the fix is a patch to the server/storage node, not the client software. Since my server version has been stable for a few months, I am reluctant to go this route right now, especially on a Friday afternoon. I will be following up with EMC on fixing the issue with the server/storage node version.

So if you see <1>,<2>,<3> etc in your daemon.log, you need to fix this now.

fyi

Mark
--
Mark Davis
University of Western Ontario - ITS
email: davism AT uwo DOT ca

Nelson, Allan wrote:
Hi Mark
I had this 2GB chunking problem too, along with another problem to do with the 
creating files in nsr/tmp on Networer 7.4.5...

* xxxx:VSS ASR DISK:\ save: There was an error creating the file: "C:\Program 
Files\Legato\nsr\tmp\sd00001e" errno: 17

I applied patch LGTOpatch20091019 on the clients concerned and that has fixed 
both problems for me.

I can easily make this available to you if you want to try it (I have the 32bit 
and 64 bit Windows versions).
Just drop me an email if you want them.

Cheeers... Allan.

________________________________________
From: EMC NetWorker discussion [NETWORKER AT LISTSERV.TEMPLE DOT EDU] On Behalf 
Of Mark Davis [davism AT UWO DOT CA]
Sent: 06 November 2009 15:01
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Subject: [Networker] 7.4 SP5 2 GB chunking of savesets

I applied this "fix" yesterday, and all the clients with 7.4 SP5 are
still doing the 2GB chunking. I've opened a case with EMC, and the best
I've got so far, is that this is a "feature". They suggested I upgrade
to NW 7.5.1.6 to fix the issue. Unreal.

Note, I am running our server on Solaris 10, so maybe the
NSR_FAIL_ON_AUTH_ERROR   =   YES fix does not apply with Solaris.

I'm currently asking EMC for a full set of 7.4.4 clients, which didn't
exhibit the "feature".

Thanks for the reply.

Mark
--
Mark Davis
University of Western Ontario - ITS
email: davism AT uwo DOT ca

James Pratt wrote:
I had this similar/same problem on 7.5 - Johannes had it right when I
asked the same question here a while back, and EMC confirmed it was
indeed the "fix" -

Btw - thanks Johannes!

jamie


-----Original Message-----
From: EMC NetWorker discussion [mailto:NETWORKER AT LISTSERV.TEMPLE DOT EDU] On
Behalf Of Jóhannes Karl Karlsson
Sent: Monday, October 05, 2009 9:35 PM
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Subject: Re: [Networker] windows 32-bit - nw 7.5.1. - random 2gb ssid chunking

We had this problem in our site when we were running NetWorker 7.4.4 32Bit on
Windows.

An EMC engineer we had working for us on site fixed it by putting this 
environment
variable on the NW server and all the storage nodes.

NSR_FAIL_ON_AUTH_ERROR   =   YES

Strangely, it's not documented on powerlink. But it worked for us.

Johannes



-----Original Message-----
From: EMC NetWorker discussion [mailto:NETWORKER AT LISTSERV.TEMPLE DOT EDU] On
Behalf Of Mark Davis
Sent: Thursday, November 05, 2009 12:02 PM
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Subject: Re: [Networker] 7.4.5 nsrck hangs

I am also seeing this "Continuation Saveset" issue on clients that are
running 7.4 SP5 downloaded last week from EMC's site. I've seen it
happen on a Windows Sever 2008 box, as well as an OS-X box.

 From my daemon.log for the OS-X box:

19:36:32 hostname.uwo.ca:<1>/Data saving to pool '3LTOH1' (300262) 1720 MB
19:39:34 hostname.uwo.ca:<2>/Data saving to pool '3LTOH1' (300262) 1755 MB
19:42:37 hostname.uwo.ca:<3>/Data saving to pool '3LTOH1' (300262) 1792 MB
19:45:43 hostname.uwo.ca:<4>/Data saving to pool '3LTOH1' (300262) 1907 MB
19:48:45 hostname.uwo.ca:<5>/Data saving to pool '3LTOH1' (300262) 1892 MB
19:51:48 hostname.uwo.ca:<6>/Data saving to pool '3LTOH1' (300262) 1973 MB
19:54:50 hostname.uwo.ca:<7>/Data continuing save
19:54:50 hostname.uwo.ca:<8>/Data saving to pool '3LTOH1' (300262) 64 MB
19:57:52 hostname.uwo.ca:<8>/Data continuing save
19:57:52 hostname.uwo.ca:<9>/Data saving to pool '3LTOH1' (300262) 128 MB
20:01:55 hostname.uwo.ca:<10>/Data saving to pool '3LTOH1' (300262) 896 MB
20:05:27 hostname.uwo.ca:<11>/Data saving to pool '3LTOH1' (300262) 1204 MB
20:09:10 hostname.uwo.ca:<12>/Data saving to pool '3LTOH1' (300262) 1677 MB
20:12:43 hostname.uwo.ca:<13>/Data continuing save
20:12:43 hostname.uwo.ca:<14>/Data saving to pool '3LTOH1' (300262) 64 MB;
20:17:15 hostname.uwo.ca:<15>/Data saving to pool '3LTOH1' (300262) 1110 MB
20:21:18 hostname.uwo.ca:<16>/Data saving to pool '3LTOH1' (300262) 1964 MB
20:24:51 hostname.uwo.ca:<18>/Data saving to pool '3LTOH1' (300262) 384 MB;


Any thoughts in the group as to why this is happening?

Thanks,

Mark
--
Mark Davis
University of Western Ontario - ITS
email: davism AT uwo DOT ca

Roberta Gold wrote:
I am using the official release of 7.4.5 downloaded from their website ...


At 12:58 PM -0500 9/8/09, Tim Mooney wrote:
A newly discovered issue (breaking up savesets again):

38706 09/04/09 15:04:22  nsrd server1:/nsr_old continuing save
38718 09/04/09 15:04:22  nsrd server1:<1>/nsr_old saving to pool
'File DD580 STG' (st.1)
Continuation savesets?  That's a blast from the past.

Are you running the released version of 7.4.5 or have you applied any
additional patches?  I ask because when EMC gives out debug binaries,
sometimes those are configured with a ss cutoff size that causes
continuation savesets to be used.

Tim
--
Tim Mooney

To sign off this list, send email to listserv AT listserv.temple DOT edu and type 
"signoff networker" in the body of the email. Please write to networker-request 
AT listserv.temple DOT edu if you have any problems with this list. You can access the 
archives at http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER

To sign off this list, send email to listserv AT listserv.temple DOT edu and type 
"signoff networker" in the body of the email. Please write to networker-request 
AT listserv.temple DOT edu if you have any problems with this list. You can access the 
archives at http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER