Networker

Re: [Networker] Group still running, but data stream has stopped and volume unmounted

2003-08-01 07:13:41
Subject: Re: [Networker] Group still running, but data stream has stopped and volume unmounted
From: Conrad Macina <conrad.macina AT PFIZER DOT COM>
To: NETWORKER AT LISTMAIL.TEMPLE DOT EDU
Date: Fri, 1 Aug 2003 07:03:38 -0400
We have the same problem with 6.1.3 under Solaris 8. I agree that it
probably is volume-related, as it occurs most frequently over the weekend,
when we do our full backups. In most cases, the save group and save
commands are no longer running, so I assume it's a GUI issue. Our SOP is to
wait for idle time on the storage node and then stop/start NetWorker on the
affected storage node only. That forces the master server to clean up its
act and get the GUI back in line with reality.





On Thu, 31 Jul 2003 18:52:12 -0400, Fowler, Carter <CFowler AT LONDON DOT CA>
wrote:

>Hi Jim;
>
>We've experienced this same issue on our Win2K / NW server.  If you
>cross reference the daemon.log with the savegrp.log you should notice
>that the save set actually completed hours prior to the timeout
>condition being met.  That's what we used to encounter.  Our solution
>was to update from 6.1.1 to 6.1.3.  This seemed to resolve the issue in
>our case.  The logs are the important part of the trouble shooting
>though.  It appears that NW has a problem in keeping track of what it's
>doing.  :)
>
>Good luck,
>
>Carter
>XT. 5711
>cfowler AT london DOT ca
>
>-----Original Message-----
>From: Neild, Jim [mailto:Jim.Neild AT SSHA.ON DOT CA]
>Sent: Thursday, July 31, 2003 1:40 PM
>To: NETWORKER AT LISTMAIL.TEMPLE DOT EDU
>Subject: Re: [Networker] Group still running, but data stream has
>stopped and volume unmounted
>
>In activity timeout is at 30 minutes.  I have multiple clients defined
>and several of them were still awaiting backup.  Finally paralellism is
>set to 4, so even if one client was slow, should the other clients keep
>the group busy and stop it from timing out?
>
>None of the clients have really big volumes (i.e 100GB max and only 7GB
>used).  Network is Gb on fast server with SAN attached disk.
>
>Anything else?
>
>Jim Neild
>Technical Analyst
>Smart Systems for Health Agency (SSHA)
>
>
>-----Original Message-----
>From: Faidherbe, Thierry [mailto:thierry.faidherbe AT hp DOT com]
>Sent: Thursday, July 31, 2003 1:36 PM
>To: Legato NetWorker discussion; Neild, Jim
>Subject: RE: [Networker] Group still running, but data stream has
>stopped and volume unmounted
>
>
>Look if the occurrence is matching with incr/differential backup
>schedules on big volumes : I experienced more than once that
>incr/differential backups of big volumes to cause inactivity time-outs
>and backup to be aborted, networker not receiving data fast enough
>because save was walking down the big volume retrieving modified files
>or because disks were I/O saturated (caused by Anti-Virus Scan).
>
>The same trouble can also be caused by big files from {slow} client,
>networker not getting CFI updates fast enough in the time range and then
>times-out.
>
>You can give a try to increase group inactivity to avoid that problem.
>
>HTH,
>
>Thierry
>
>Kind regards - Bien cordialement - Vriendelijke groeten,
>
>Thierry FAIDHERBE
>
>HPCI - Storage & Server Integration Practice
>Tru64 Unix and Legato EBS Consultant
>
> *********       *********   HEWLETT - PACKARD
> *******    h      *******   1 Rue de l'aeronef/Luchtschipstraat
> ******    h        ******   1140 Bruxelles/Brussel/Brussels
> *****    hhhh  pppp *****
> *****   h  h  p  p  *****   100/102 Blv de la Woluwe/Woluwedal
> *****  h  h  pppp   *****   1200 Bruxelles/Brussel/Brussels
> ******      p      ******   BELGIUM
> *******    p      *******
> *********       *********   Phone :    +32 (0)2  / 729.85.42
>                             Mobile :   +32 (0)498/  94.60.85
>                             Fax :      +32 (0)2  / 729.88.30
>     I  N  V  E  N  T        Email :    thierry.faidherbe AT hp DOT com
>                             Internet : http://www.hp.com/
>________________________________________________________________________
>
>MOBISTAR SA/NV
>
>SYSTEM Team Charleroi, Mermoz 2 Phone : +32 (0)2  / 745.75.81
>Avenue Jean Mermoz, 32          Fax :   +32 (0)2  / 745.89.56
>6041 GOSSELIES                  Email : tfhaidhe AT mail.mobistar DOT be
>BELGIUM                         Web :   http://www.mobistar.be/
>________________________________________________________________________
>
>
>
>
>-----Original Message-----
>From: Neild, Jim [mailto:Jim.Neild AT SSHA.ON DOT CA]
>Sent: Thursday, July 31, 2003 7:26 PM
>To: NETWORKER AT LISTMAIL.TEMPLE DOT EDU
>Subject: Re: [Networker] Group still running, but data stream has
>stopped and volume unmounted
>
>
>What is the cause though?  It happens from time to time and it really
>ticks me off.  I have multiple "incomplete savesets" listed under
>details.  What is the best way to troubleshoot this type of problem?
>
>Cheers,
>Jim
>
>Jim Neild
>Technical Analyst
>Smart Systems for Health Agency (SSHA)
>
>
>-----Original Message-----
>From: Librado Pamintuan [mailto:LPAMINTU AT REGINA DOT CA]
>Sent: Thursday, July 31, 2003 1:14 PM
>To: NETWORKER AT LISTMAIL.TEMPLE DOT EDU
>Subject: Re: [Networker] Group still running, but data stream has
>stopped and volume unmounted
>
>
>look for the 'save' process on the client side and kill it (Unix) or
>stop it (Windows). I had this issue lots of time.
>
>Librado
>
>
>>>> maarten.boot AT NL.COMPUWARE DOT COM 07/31/03 10:31am >>>
>On the networker GUI go to Group Control
>if there is a group still running check it with the Details option is
>there a client still pending?
>
>if it is not critical you can stop the group there with the stop button
>(my gui is on unix)
>
>Maarten
>
>On Thursday 31 July 2003 18:26, you wrote:
>> This must be a common issue.  I have a group that I kicked off and it
>is
>> running and backing up tickity-boo.  Then data stops streaming to
>tape,
>> group is still running, eventually the volume is unmounted due to
>lack
>> of inactivity, yet the group is still "running"
>>
>> Any ideas?
>>
>> Cheers,
>> Jim
>>
>> --
>> Note: To sign off this list, send a "signoff networker" command via
>email
>> to listserv AT listmail.temple DOT edu or visit the list's Web site at
>> http://listmail.temple.edu/archives/networker.html where you can also
>> view and post messages to the list.
>> =*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=
>
>--
>Note: To sign off this list, send a "signoff networker" command via
>email to listserv AT listmail.temple DOT edu or visit the list's Web site at
>http://listmail.temple.edu/archives/networker.html where you can also
>view and post messages to the list.
>=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=
>
>
>--
>Note: To sign off this list, send a "signoff networker" command via
>email to listserv AT listmail.temple DOT edu or visit the list's Web site at
>http://listmail.temple.edu/archives/networker.html where you can also
>view and post messages to the list.
>=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=
>
>--
>Note: To sign off this list, send a "signoff networker" command via
>email to listserv AT listmail.temple DOT edu or visit the list's Web site at
>http://listmail.temple.edu/archives/networker.html where you can also
>view and post messages to the list.
>=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=
>
>--
>Note: To sign off this list, send a "signoff networker" command via
>email
>to listserv AT listmail.temple DOT edu or visit the list's Web site at
>http://listmail.temple.edu/archives/networker.html where you can
>also view and post messages to the list.
>=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=
>
>--
>Note: To sign off this list, send a "signoff networker" command via email
>to listserv AT listmail.temple DOT edu or visit the list's Web site at
>http://listmail.temple.edu/archives/networker.html where you can
>also view and post messages to the list.
>=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=

--
Note: To sign off this list, send a "signoff networker" command via email
to listserv AT listmail.temple DOT edu or visit the list's Web site at
http://listmail.temple.edu/archives/networker.html where you can
also view and post messages to the list.
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=

<Prev in Thread] Current Thread [Next in Thread>