Veritas-bu

[Veritas-bu] Can't kill an "Active" job...

2005-12-07 16:12:17
Subject: [Veritas-bu] Can't kill an "Active" job...
From: Gregory.Geyer AT Avnet DOT com (Geyer, Gregory)
Date: Wed, 7 Dec 2005 14:12:17 -0700
This is a multi-part message in MIME format.

------_=_NextPart_001_01C5FB72.E70FFA7A
Content-Type: text/plain;
        charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

You are not alone.  And sometimes they persist after a reboot (which
means I move the /usr/openv/netbackup/db/jobs/bpjobd.act.db to a backup
name prior to restarting NBU).
=20
I used to have success killing the child bpsched PID (listed as "Active
PID") but the last couple of times I tried that, it caused the primary
bpsched process to restart....dropping all active backups.  So I don't
do that anymore and just wait for a bounce and tell ops to ignore the
phantom jobs.
=20
I don't know but I suspect if bpsched gets a breather it might clean
this up, but my bpsched never gets a breather so I don't know.
=20
G.

________________________________

From: veritas-bu-admin AT mailman.eng.auburn DOT edu
[mailto:veritas-bu-admin AT mailman.eng.auburn DOT edu] On Behalf Of Aaron
Mills
Sent: Wednesday, December 07, 2005 1:58 PM
To: veritas-bu AT mailman.eng.auburn DOT edu
Subject: [Veritas-bu] Can't kill an "Active" job...



I have several jobs that are stuck in "active" mode but not doing
anything.=20

JobID   Type  State Status          Policy        Schedule
Client          Dest Media Svr Active PID

 9716 Backup Active                inbound           ftpif
foo.com       foo.com         5376=20
 9018 Backup Active                inbound           ftpif
foo.com       foo.com         19580=20
 9845 Backup Queued              DBArchive   Oracle-Policy
bar.com=20
 9844 Backup Queued              DBArchive   Oracle-Policy      bar.com=20

I tried to kill the job with:=20

bpdbjobs -cancel 9716=20

to no avail. The FAQ-O-Matic says I need to restart NBU and manually
delete the db files? Is this accurate or is there an easier way to clean
these buggers up?

Thanks.=20

Aaron Mills=20
System Administrator=20
Return Path, Inc.=20
303.642.4111=20
aaron.mills AT returnpath DOT net=20
http://www.returnpath.biz <http://www.returnpath.biz> =20



------_=_NextPart_001_01C5FB72.E70FFA7A
Content-Type: text/html;
        charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD><TITLE>Can't kill an "Active" job...</TITLE>
<META http-equiv=3DContent-Type content=3D"text/html; =
charset=3Dus-ascii">
<META content=3D"MSHTML 6.00.2900.2769" name=3DGENERATOR></HEAD>
<BODY>
<DIV dir=3Dltr align=3Dleft><SPAN class=3D166190821-07122005><FONT =
face=3D"Courier New"=20
color=3D#0000ff size=3D2>You are not alone.&nbsp; And sometimes they =
persist after a=20
reboot (which means I move the =
/usr/openv/netbackup/db/jobs/bpjobd.act.db to a=20
backup name prior to restarting NBU).</FONT></SPAN></DIV>
<DIV dir=3Dltr align=3Dleft><SPAN class=3D166190821-07122005><FONT =
face=3D"Courier New"=20
color=3D#0000ff size=3D2></FONT></SPAN>&nbsp;</DIV>
<DIV dir=3Dltr align=3Dleft><SPAN class=3D166190821-07122005><FONT =
face=3D"Courier New"=20
color=3D#0000ff size=3D2>I used to have success killing the child =
bpsched PID=20
(listed as "Active PID") but the last couple of times I tried that, it =
caused=20
the primary bpsched process to restart....dropping all active =
backups.&nbsp; So=20
I don't do that anymore and just wait for a bounce and tell ops to =
ignore the=20
phantom jobs.</FONT></SPAN></DIV>
<DIV dir=3Dltr align=3Dleft><SPAN class=3D166190821-07122005><FONT =
face=3D"Courier New"=20
color=3D#0000ff size=3D2></FONT></SPAN>&nbsp;</DIV>
<DIV dir=3Dltr align=3Dleft><SPAN class=3D166190821-07122005><FONT =
face=3D"Courier New"=20
color=3D#0000ff size=3D2>I don't know but I suspect if bpsched gets a =
breather it=20
might clean this up, but my bpsched never gets a breather so I don't=20
know.</FONT></SPAN></DIV>
<DIV dir=3Dltr align=3Dleft><SPAN class=3D166190821-07122005><FONT =
face=3D"Courier New"=20
color=3D#0000ff size=3D2></FONT></SPAN>&nbsp;</DIV>
<DIV dir=3Dltr align=3Dleft><SPAN class=3D166190821-07122005><FONT =
face=3D"Courier New"=20
color=3D#0000ff size=3D2>G.</FONT></SPAN></DIV><BR>
<DIV class=3DOutlookMessageHeader lang=3Den-us dir=3Dltr align=3Dleft>
<HR tabIndex=3D-1>
<FONT face=3DTahoma size=3D2><B>From:</B> =
veritas-bu-admin AT mailman.eng.auburn DOT edu=20
[mailto:veritas-bu-admin AT mailman.eng.auburn DOT edu] <B>On Behalf Of =
</B>Aaron=20
Mills<BR><B>Sent:</B> Wednesday, December 07, 2005 1:58 PM<BR><B>To:</B> =

veritas-bu AT mailman.eng.auburn DOT edu<BR><B>Subject:</B> [Veritas-bu] Can't =
kill an=20
"Active" job...<BR></FONT><BR></DIV>
<DIV></DIV><!-- Converted from text/rtf format -->
<P><FONT face=3DArial size=3D2>I have several jobs that are stuck in =
"active" mode=20
but not doing anything. </FONT></P>
<P><FONT face=3DArial size=3D2>JobID&nbsp;&nbsp; Type&nbsp; State=20
Status&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=20
Policy&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=20
Schedule&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp=
;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=
&nbsp;=20
Client&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Dest Media =
Svr=20
Active PID</FONT></P>
<P><FONT face=3DArial size=3D2>&nbsp;9716 Backup=20
Active&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&=
nbsp;&nbsp;&nbsp;&nbsp;=20
inbound&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=20
ftpif&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=20
foo.com&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; foo.com=20
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 5376</FONT> <BR><FONT =
face=3DArial=20
size=3D2>&nbsp;9018 Backup=20
Active&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&=
nbsp;&nbsp;&nbsp;&nbsp;=20
inbound&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=20
ftpif&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; foo.com=20
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; foo.com=20
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 19580</FONT> <BR><FONT =
face=3DArial=20
size=3D2>&nbsp;9845 Backup=20
Queued&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&=
nbsp;&nbsp;=20
DBArchive&nbsp;&nbsp; =
Oracle-Policy&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=20
bar.com</FONT> <BR><FONT face=3DArial size=3D2>&nbsp;9844 Backup=20
Queued&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&=
nbsp;&nbsp;=20
DBArchive&nbsp;&nbsp; Oracle-Policy &nbsp;&nbsp;&nbsp;&nbsp; =
bar.com</FONT> </P>
<P><FONT face=3DArial size=3D2>I tried to kill the job with:</FONT> </P>
<P><FONT face=3DArial size=3D2>bpdbjobs -cancel 9716</FONT> </P>
<P><FONT face=3DArial size=3D2>to no avail. The FAQ-O-Matic says I need =
to restart=20
NBU and manually delete the db files? Is this accurate or is there an =
easier way=20
to clean these buggers up?</FONT></P>
<P><FONT face=3DArial size=3D2>Thanks.</FONT> </P>
<P><FONT face=3DArial size=3D2>Aaron Mills</FONT> <BR><FONT face=3DArial =
size=3D2>System=20
Administrator</FONT> <BR><FONT face=3DArial size=3D2>Return Path, =
Inc.</FONT>=20
<BR><FONT face=3DArial size=3D2>303.642.4111</FONT> <BR><FONT =
face=3DArial=20
size=3D2>aaron.mills AT returnpath DOT net</FONT> <BR><A=20
href=3D"http://www.returnpath.biz";><U><FONT face=3DArial color=3D#0000ff =

size=3D2>http://www.returnpath.biz</FONT></U></A> </P><BR></BODY></HTML>

------_=_NextPart_001_01C5FB72.E70FFA7A--

<Prev in Thread] Current Thread [Next in Thread>