Veritas-bu

[Veritas-bu] Jobs not starting...

2002-10-01 10:15:26
Subject: [Veritas-bu] Jobs not starting...
From: MTNiehaus AT MarathonOil DOT com (Niehaus, Michael T.)
Date: Tue, 1 Oct 2002 09:15:26 -0500
This is a multi-part message in MIME format.

------_=_NextPart_001_01C26954.FD015ED0
Content-Type: text/plain;
        charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

Do you have anything in your notify scripts?  We see this sometimes when =
the notify/email scripts generate error dialogs - the jobs just hang =
until someone acknowledges the dialogs, which will never happen.  =
Execute the following command on the server to see the process tree:
=20
tlist -t
=20
This should show you the relationship between the "bpsched" processes =
(one of those at the bottom of the tree should have a process ID that =
corresponds to the process ID of the hung job - with any luck you will =
see a child process as well).  The lowest process ID "bpsched" is =
probably the task doing the actual scheduling.  Restarting the =
"NetBackup Request Manager" service should get it back running again.  =
Still, killing "bpsched" tasks should be a last resort, as it is =
probably just treating the symptom and not the problem.
=20
-Michael

-----Original Message-----
From: Marx, Keath [mailto:kmarx AT trigon DOT com]
Sent: Tuesday, October 01, 2002 8:46 AM
To: veritas-bu AT mailman.eng.auburn DOT edu
Subject: [Veritas-bu] Jobs not starting...



We have NBU 4.5 with 45._1 installed on win2k backing up 2k clients.  =
All of the daily tasks are schedule not frequency based. =20

The last change I made before the jobs started failing was to create a =
task to perform cummulative inc backups of a local directory.  I ran =
that task and since then nothing had run.  I rebooted and my normal =
cummulative tasks that run during the day kicked off.  One ran to 100% =
then hung.  I kicked off some daily tasks and again 1 ran to 100% then =
hung. =20

Task manager shows multiple copies of bpsched running.  If I kill the =
bpsched with the lowest process ID all the rest go away but nothing =
changes.  If I kill the bpsched with the highest process ID it goes away =
but the others stay and nothing happens.  If I kill ALL the bpsched =
nothing new happens.

Not out of disk space on any drive.  An automated Duplicate ran this =
morning without error.  As mentioned nothing was changed on the server.

TIA=20
Keath Marx=20


------_=_NextPart_001_01C26954.FD015ED0
Content-Type: text/html;
        charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
charset=3Diso-8859-1">
<TITLE>Jobs not starting...</TITLE>

<META content=3D"MSHTML 5.50.4616.200" name=3DGENERATOR></HEAD>
<BODY>
<DIV><SPAN class=3D330131114-01102002><FONT face=3DArial color=3D#0000ff =
size=3D2>Do you=20
have anything in your notify scripts?&nbsp; We see this sometimes when =
the=20
notify/email scripts generate error dialogs - the jobs just hang until =
someone=20
acknowledges the dialogs, which will never happen.&nbsp; Execute the =
following=20
command on the server to see the process tree:</FONT></SPAN></DIV>
<DIV><SPAN class=3D330131114-01102002><FONT face=3DArial color=3D#0000ff =

size=3D2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=3D330131114-01102002><FONT face=3DArial color=3D#0000ff =
size=3D2>tlist=20
-t</FONT></SPAN></DIV>
<DIV><SPAN class=3D330131114-01102002><FONT face=3DArial color=3D#0000ff =

size=3D2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=3D330131114-01102002><FONT face=3DArial color=3D#0000ff =
size=3D2>This=20
should show you the relationship between the "bpsched" processes (one of =
those=20
at the bottom of the tree should have a process ID that corresponds to =
the=20
process ID of the hung job - with any luck you will see a child process =
as=20
well).&nbsp; The lowest process ID "bpsched" is probably the task doing =
the=20
actual scheduling.&nbsp; Restarting the "NetBackup Request Manager" =
service=20
should get it back running again.&nbsp; Still, killing "bpsched" tasks =
should be=20
a last resort, as it is probably just treating the symptom and not the=20
problem.</FONT></SPAN></DIV>
<DIV><SPAN class=3D330131114-01102002><FONT face=3DArial color=3D#0000ff =

size=3D2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=3D330131114-01102002><FONT face=3DArial color=3D#0000ff =

size=3D2>-Michael</FONT></SPAN></DIV>
<BLOCKQUOTE dir=3Dltr style=3D"MARGIN-RIGHT: 0px">
  <DIV class=3DOutlookMessageHeader dir=3Dltr align=3Dleft><FONT =
face=3DTahoma=20
  size=3D2>-----Original Message-----<BR><B>From:</B> Marx, Keath=20
  [mailto:kmarx AT trigon DOT com]<BR><B>Sent:</B> Tuesday, October 01, 2002 =
8:46=20
  AM<BR><B>To:</B> veritas-bu AT mailman.eng.auburn DOT 
edu<BR><B>Subject:</B>=20
  [Veritas-bu] Jobs not starting...<BR><BR></FONT></DIV>
  <P><FONT face=3DArial size=3D2>We have NBU 4.5 with 45._1 installed on =
win2k=20
  backing up 2k clients.&nbsp; All of the daily tasks are schedule not =
frequency=20
  based.&nbsp; </FONT></P>
  <P><FONT face=3DArial size=3D2>The last change I made before the jobs =
started=20
  failing was to create a task to perform cummulative inc backups of a =
local=20
  directory.&nbsp; I ran that task and since then nothing had run.&nbsp; =
I=20
  rebooted and my normal cummulative tasks that run during the day =
kicked=20
  off.&nbsp; One ran to 100% then hung.&nbsp; I kicked off some daily =
tasks and=20
  again 1 ran to 100% then hung.&nbsp; </FONT></P>
  <P><FONT face=3DArial size=3D2>Task manager shows multiple copies of =
bpsched=20
  running.&nbsp; If I kill the bpsched with the lowest process ID all =
the rest=20
  go away but nothing changes.&nbsp; If I kill the bpsched with the =
highest=20
  process ID it goes away but the others stay and nothing happens.&nbsp; =
If I=20
  kill ALL the bpsched nothing new happens.</FONT></P>
  <P><FONT face=3DArial size=3D2>Not out of disk space on any =
drive.&nbsp; An=20
  automated Duplicate ran this morning without error.&nbsp; As mentioned =
nothing=20
  was changed on the server.</FONT></P>
  <P><FONT face=3DArial size=3D2>TIA</FONT> <BR><FONT face=3DArial =
size=3D2>Keath=20
  Marx</FONT> </P></BLOCKQUOTE></BODY></HTML>

------_=_NextPart_001_01C26954.FD015ED0--

<Prev in Thread] Current Thread [Next in Thread>