Veritas-bu

[Veritas-bu] Periodic jobs hangs

2003-07-11 15:16:59
Subject: [Veritas-bu] Periodic jobs hangs
From: Dmitri.Smirnov AT fusepoint DOT com (Dmitri Smirnov)
Date: Fri, 11 Jul 2003 12:16:59 -0700
This is a multi-part message in MIME format.

------_=_NextPart_001_01C347E1.0016F8A8
Content-Type: text/plain;
        charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

=20
I've started to have jobs hang periodically. They never finish and
Netbackup doesn't start other jobs waiting for them to finish (as soon
as all drives allocated for failed jobs).
I don't have any error messages in log files, I've tried to use
Netbackup with NOSHM option and extended IPC resource as much as I could
imagine.
=20
And now I have a feeling that I know why it is happened. I have a number
of servers registered with more then one class. Due to a small backup
window
Netbackup starts a number of jobs for one server - for example File and
SQL backups at the same time. Everything happened if both jobs has the
same retention period (so they will use the same tape)
One job mounts media and start backup then other job starts, detects
media mounted and waiting for client to send something (and position
tape periodically)...=20
At the end of this backup - first job unmounts media and second job
hangs since it thinks media is mounted...
=20
What do you think group - is it possible? How could I prevent it?
=20
Dmitri

------_=_NextPart_001_01C347E1.0016F8A8
Content-Type: text/html;
        charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META http-equiv=3DContent-Type content=3D"text/html; =
charset=3Dus-ascii">
<META content=3D"MSHTML 6.00.2800.1170" name=3DGENERATOR></HEAD>
<BODY>
<DIV><FONT face=3DArial size=3D2><SPAN=20
class=3D209305418-11072003></SPAN></FONT>&nbsp;</DIV>
<DIV><FONT face=3DArial size=3D2><SPAN class=3D209305418-11072003>I've =
started to have=20
jobs hang periodically. They never finish and Netbackup doesn't start =
other jobs=20
waiting for them to finish (as soon as all drives allocated for failed=20
jobs).</SPAN></FONT></DIV>
<DIV><FONT face=3DArial size=3D2><SPAN class=3D209305418-11072003>I =
don't have any=20
error messages in log files, I've tried to use Netbackup with NOSHM =
option and=20
extended IPC resource as much as I could imagine.</SPAN></FONT></DIV>
<DIV><FONT face=3DArial size=3D2><SPAN=20
class=3D209305418-11072003></SPAN></FONT>&nbsp;</DIV>
<DIV><FONT face=3DArial size=3D2><SPAN class=3D209305418-11072003>And =
now I have a=20
feeling that I know why it is happened. I have a number of servers =
registered=20
with more then one class. Due to a small backup =
window</SPAN></FONT></DIV>
<DIV><FONT face=3DArial size=3D2><SPAN =
class=3D209305418-11072003>Netbackup starts a=20
number of jobs for one server - for example File and SQL backups at the =
same=20
time. Everything happened if both jobs has the same retention period (so =
they=20
will use the same tape)</SPAN></FONT></DIV>
<DIV><FONT face=3DArial size=3D2><SPAN class=3D209305418-11072003>One =
job mounts media=20
and start backup then other job starts, detects media mounted=20
</SPAN></FONT><FONT face=3DArial size=3D2><SPAN =
class=3D209305418-11072003>and waiting=20
for </SPAN></FONT><FONT face=3DArial size=3D2><SPAN =
class=3D209305418-11072003>client=20
to send something (and position tape periodically)... =
</SPAN></FONT></DIV>
<DIV><FONT face=3DArial size=3D2><SPAN class=3D209305418-11072003>At the =
end of this=20
backup - first job unmounts media and second job hangs since it thinks =
media is=20
mounted...</SPAN></FONT></DIV>
<DIV><FONT face=3DArial size=3D2><SPAN=20
class=3D209305418-11072003></SPAN></FONT>&nbsp;</DIV>
<DIV><FONT face=3DArial size=3D2><SPAN class=3D209305418-11072003>What =
do you think=20
group - is it possible? How could I prevent it?</SPAN></FONT></DIV>
<DIV><FONT face=3DArial size=3D2><SPAN=20
class=3D209305418-11072003></SPAN></FONT>&nbsp;</DIV>
<DIV><FONT face=3DArial size=3D2><SPAN=20
class=3D209305418-11072003>Dmitri</SPAN></FONT></DIV></BODY></HTML>

------_=_NextPart_001_01C347E1.0016F8A8--

<Prev in Thread] Current Thread [Next in Thread>
  • [Veritas-bu] Periodic jobs hangs, Dmitri Smirnov <=