Networker

Re: [Networker] Group still running, but data stream has stopped and volume unmounted

2003-07-31 18:52:20
Subject: Re: [Networker] Group still running, but data stream has stopped and volume unmounted
From: "Fowler, Carter" <CFowler AT LONDON DOT CA>
To: NETWORKER AT LISTMAIL.TEMPLE DOT EDU
Date: Thu, 31 Jul 2003 18:52:12 -0400
Hi Jim;

We've experienced this same issue on our Win2K / NW server.  If you
cross reference the daemon.log with the savegrp.log you should notice
that the save set actually completed hours prior to the timeout
condition being met.  That's what we used to encounter.  Our solution
was to update from 6.1.1 to 6.1.3.  This seemed to resolve the issue in
our case.  The logs are the important part of the trouble shooting
though.  It appears that NW has a problem in keeping track of what it's
doing.  :)

Good luck,

Carter
XT. 5711
cfowler AT london DOT ca
 
-----Original Message-----
From: Neild, Jim [mailto:Jim.Neild AT SSHA.ON DOT CA] 
Sent: Thursday, July 31, 2003 1:40 PM
To: NETWORKER AT LISTMAIL.TEMPLE DOT EDU
Subject: Re: [Networker] Group still running, but data stream has
stopped and volume unmounted

In activity timeout is at 30 minutes.  I have multiple clients defined
and several of them were still awaiting backup.  Finally paralellism is
set to 4, so even if one client was slow, should the other clients keep
the group busy and stop it from timing out?

None of the clients have really big volumes (i.e 100GB max and only 7GB
used).  Network is Gb on fast server with SAN attached disk.

Anything else?

Jim Neild
Technical Analyst
Smart Systems for Health Agency (SSHA)


-----Original Message-----
From: Faidherbe, Thierry [mailto:thierry.faidherbe AT hp DOT com] 
Sent: Thursday, July 31, 2003 1:36 PM
To: Legato NetWorker discussion; Neild, Jim
Subject: RE: [Networker] Group still running, but data stream has
stopped and volume unmounted


Look if the occurrence is matching with incr/differential backup
schedules on big volumes : I experienced more than once that
incr/differential backups of big volumes to cause inactivity time-outs
and backup to be aborted, networker not receiving data fast enough
because save was walking down the big volume retrieving modified files
or because disks were I/O saturated (caused by Anti-Virus Scan).

The same trouble can also be caused by big files from {slow} client,
networker not getting CFI updates fast enough in the time range and then
times-out.

You can give a try to increase group inactivity to avoid that problem.

HTH,

Thierry

Kind regards - Bien cordialement - Vriendelijke groeten,

Thierry FAIDHERBE

HPCI - Storage & Server Integration Practice 
Tru64 Unix and Legato EBS Consultant
                                   
 *********       *********   HEWLETT - PACKARD
 *******    h      *******   1 Rue de l'aeronef/Luchtschipstraat
 ******    h        ******   1140 Bruxelles/Brussel/Brussels
 *****    hhhh  pppp *****   
 *****   h  h  p  p  *****   100/102 Blv de la Woluwe/Woluwedal
 *****  h  h  pppp   *****   1200 Bruxelles/Brussel/Brussels
 ******      p      ******   BELGIUM
 *******    p      *******                              
 *********       *********   Phone :    +32 (0)2  / 729.85.42   
                             Mobile :   +32 (0)498/  94.60.85 
                             Fax :      +32 (0)2  / 729.88.30   
     I  N  V  E  N  T        Email :    thierry.faidherbe AT hp DOT com
                             Internet : http://www.hp.com/
________________________________________________________________________

MOBISTAR SA/NV 

SYSTEM Team Charleroi, Mermoz 2 Phone : +32 (0)2  / 745.75.81  
Avenue Jean Mermoz, 32          Fax :   +32 (0)2  / 745.89.56  
6041 GOSSELIES                  Email : tfhaidhe AT mail.mobistar DOT be
BELGIUM                         Web :   http://www.mobistar.be/
________________________________________________________________________

  


-----Original Message-----
From: Neild, Jim [mailto:Jim.Neild AT SSHA.ON DOT CA] 
Sent: Thursday, July 31, 2003 7:26 PM
To: NETWORKER AT LISTMAIL.TEMPLE DOT EDU
Subject: Re: [Networker] Group still running, but data stream has
stopped and volume unmounted


What is the cause though?  It happens from time to time and it really
ticks me off.  I have multiple "incomplete savesets" listed under
details.  What is the best way to troubleshoot this type of problem?

Cheers,
Jim

Jim Neild
Technical Analyst
Smart Systems for Health Agency (SSHA)


-----Original Message-----
From: Librado Pamintuan [mailto:LPAMINTU AT REGINA DOT CA] 
Sent: Thursday, July 31, 2003 1:14 PM
To: NETWORKER AT LISTMAIL.TEMPLE DOT EDU
Subject: Re: [Networker] Group still running, but data stream has
stopped and volume unmounted


look for the 'save' process on the client side and kill it (Unix) or
stop it (Windows). I had this issue lots of time.

Librado


>>> maarten.boot AT NL.COMPUWARE DOT COM 07/31/03 10:31am >>>
On the networker GUI go to Group Control
if there is a group still running check it with the Details option is
there a client still pending?

if it is not critical you can stop the group there with the stop button
(my gui is on unix)

Maarten

On Thursday 31 July 2003 18:26, you wrote:
> This must be a common issue.  I have a group that I kicked off and it
is
> running and backing up tickity-boo.  Then data stops streaming to
tape,
> group is still running, eventually the volume is unmounted due to
lack
> of inactivity, yet the group is still "running"
>
> Any ideas?
>
> Cheers,
> Jim
>
> --
> Note: To sign off this list, send a "signoff networker" command via
email
> to listserv AT listmail.temple DOT edu or visit the list's Web site at
> http://listmail.temple.edu/archives/networker.html where you can also 
> view and post messages to the list. 
> =*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=

--
Note: To sign off this list, send a "signoff networker" command via
email to listserv AT listmail.temple DOT edu or visit the list's Web site at
http://listmail.temple.edu/archives/networker.html where you can also
view and post messages to the list.
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=


--
Note: To sign off this list, send a "signoff networker" command via
email to listserv AT listmail.temple DOT edu or visit the list's Web site at
http://listmail.temple.edu/archives/networker.html where you can also
view and post messages to the list.
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=

--
Note: To sign off this list, send a "signoff networker" command via
email to listserv AT listmail.temple DOT edu or visit the list's Web site at
http://listmail.temple.edu/archives/networker.html where you can also
view and post messages to the list.
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=

--
Note: To sign off this list, send a "signoff networker" command via
email
to listserv AT listmail.temple DOT edu or visit the list's Web site at
http://listmail.temple.edu/archives/networker.html where you can
also view and post messages to the list.
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=

--
Note: To sign off this list, send a "signoff networker" command via email
to listserv AT listmail.temple DOT edu or visit the list's Web site at
http://listmail.temple.edu/archives/networker.html where you can
also view and post messages to the list.
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=