Veritas-bu

[Veritas-bu] Bpsched crashing

2006-05-16 09:03:51
Subject: [Veritas-bu] Bpsched crashing
From: Anderson.Mccammont AT morganstanley DOT com (McCammont, Anderson (IT))
Date: Tue, 16 May 2006 14:03:51 +0100
This is a multi-part message in MIME format.

------_=_NextPart_001_01C678E9.2DCF5D18
Content-Type: text/plain;
        charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

if you have a hang, truss bpsched main and see whether it's blocked
doing a msgsnd().  Do the ipcs -qA and look for CBYTES being close to
QBYTES.  If it is truss the rest of the bpscheds and see if any are
attempting to do a msgrcv().
=20
Increasing the queue varies depending on OS release.  For Sol 9 check
http://docs.sun.com/app/docs/doc/806-7009/6jftnqsjp?a=3Dview.
msgsys:msginfo_msgtql  is the pertinant one for sol9 iirc.
msgsys:msginfo_msgmnb and others used to be relevant on previous
versions.
=20
Apparently the message passing routines have been reworked in NBU6 - I
haven't seen this myself, but it would be welcome as they're sorely in
need of it.
=20



________________________________

        From: veritas-bu-admin AT mailman.eng.auburn DOT edu
[mailto:veritas-bu-admin AT mailman.eng.auburn DOT edu] On Behalf Of Hindle,
Greg
        Sent: 15 May 2006 15:06
        To: Len Boyle; veritas-bu AT mailman.eng.auburn DOT edu
        Subject: RE: [Veritas-bu] Bpsched crashing
=09
=09
        We had the shared memory setting set to use all available memory
and were told by Symantec to lower that figure to 6 gig and leave 2 gig
for Solaris 9 (we jave 8 gig ram). We have not adjusted any msg queues.
Where would I look and what should they be?=20
        =20


        Greg


________________________________

        From: Len Boyle [mailto:Len.Boyle AT sas DOT com]=20
        Sent: Monday, May 15, 2006 9:48 AM
        To: Hindle, Greg; veritas-bu AT mailman.eng.auburn DOT edu
        Subject: RE: [Veritas-bu] Bpsched crashing
=09
=09
        Good Morning Greg,=20
        =20
        Have you changed setting in the /etc/system file to increased
things such as shared memory and msg queues?=20
        =20
        len

________________________________

        From: veritas-bu-admin AT mailman.eng.auburn DOT edu
[mailto:veritas-bu-admin AT mailman.eng.auburn DOT edu] On Behalf Of Hindle,
Greg
        Sent: Monday, May 15, 2006 9:04 AM
        To: veritas-bu AT mailman.eng.auburn DOT edu
        Subject: [Veritas-bu] Bpsched crashing
=09
=09

        Nb 5.0 mp6 Solaris 9=20

        We are having an on going issue with bpsched crashing/stopping.
When this happens all the jobs go 150. Sometimes it recovers and
restarts the jobs and other times we have to stop and start the
services. Has any one else had this issues and what you did to fix? We
have a open ticket with Symantec and they have recommend some tuning
changes, which we have done, but it still went down over the weekend.
Symantec thinks it is a resource issue. We have 8 gig of ram in the
master server and it seems to crash most often about 15 minutes in the
main backup window. We would have around 200 jobs running, with 800
queued. This has not been an issue in the past. Any ideas?


        Greg=20

        >>> This e-mail and any attachments are confidential, may
contain legal, professional or other privileged information, and are
intended solely for the addressee.  If you are not the intended
recipient, do not use the information in this e-mail in any way, delete
this e-mail and notify the sender. CEG-IP1
--------------------------------------------------------

NOTICE: If received in error, please destroy and notify sender.  Sender =
does not waive confidentiality or privilege, and use is prohibited.

------_=_NextPart_001_01C678E9.2DCF5D18
Content-Type: text/html;
        charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

<HTML xmlns:eXclaimer=3D"http://www.exclaimer.co.uk"; =
xmlns:msxsl=3D"urn:schemas-microsoft-com:xslt" =
xmlns:exc=3D"http://www.exclaimer.co.uk/rtf";>
<HEAD>
<META http-equiv=3D"Content-Type" content=3D"text/html; =
charset=3DUTF-16">

<TITLE>Bpsched crashing</TITLE>
<META http-equiv=3DContent-Type content=3D"text/html; =
charset=3Dus-ascii">
<META content=3D"MSHTML 6.00.2900.2876" name=3DGENERATOR></HEAD>
<BODY >
<DIV>
<DIV dir=3Dltr align=3Dleft><SPAN class=3D498193512-16052006><FONT =
face=3DArial=20
color=3D#0000ff size=3D2>if you have a hang,&nbsp;truss&nbsp;bpsched =
main and see=20
whether it's blocked doing a msgsnd().&nbsp; Do the ipcs -qA and look =
for CBYTES=20
being close to QBYTES.&nbsp; If it is truss the rest of the bpscheds and =
see if=20
any are attempting to do a msgrcv().</FONT></SPAN></DIV>
<DIV dir=3Dltr align=3Dleft><SPAN class=3D498193512-16052006><FONT =
face=3DArial=20
color=3D#0000ff size=3D2></FONT></SPAN>&nbsp;</DIV>
<DIV dir=3Dltr align=3Dleft><SPAN class=3D498193512-16052006><FONT =
face=3DArial=20
color=3D#0000ff size=3D2>Increasing the queue varies depending on OS =
release.&nbsp;=20
For Sol 9 check <A=20
href=3D"http://docs.sun.com/app/docs/doc/806-7009/6jftnqsjp?a=3Dview";>htt=
p://docs.sun.com/app/docs/doc/806-7009/6jftnqsjp?a=3Dview</A>.</FONT></SP=
AN></DIV>
<DIV dir=3Dltr align=3Dleft><SPAN class=3D498193512-16052006><FONT =
color=3D#0000ff=20
size=3D2></FONT></SPAN><SPAN class=3D498193512-16052006><FONT =
face=3DArial=20
size=3D2>msgsys:msginfo_msgtql&nbsp; is the pertinant one for sol9 =
iirc.&nbsp;=20
msgsys:msginfo_msgmnb and others used to be relevant on previous=20
versions.</FONT></SPAN></DIV>
<DIV><FONT face=3DArial><SPAN class=3D498193512-16052006><FONT=20
size=3D2></FONT></SPAN><FONT size=3D2></FONT></FONT>&nbsp;</DIV>
<DIV><SPAN class=3D498193512-16052006><FONT face=3DArial =
size=3D2>Apparently the=20
message passing routines have been reworked in NBU6 - I haven't seen =
this=20
myself, but it would be welcome as they're sorely in need of=20
it.</FONT></SPAN></DIV>
<DIV><SPAN class=3D498193512-16052006><FONT face=3DArial=20
size=3D2></FONT></SPAN>&nbsp;</DIV>
<DIV dir=3Dltr align=3Dleft><FONT face=3DArial><BR></FONT></DIV>
<BLOCKQUOTE dir=3Dltr=20
style=3D"PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #0000ff 2px =
solid; MARGIN-RIGHT: 0px">
  <DIV class=3DOutlookMessageHeader lang=3Den-us dir=3Dltr align=3Dleft>
  <HR tabIndex=3D-1>
  <FONT face=3DTahoma size=3D2><B>From:</B> =
veritas-bu-admin AT mailman.eng.auburn DOT edu=20
  [mailto:veritas-bu-admin AT mailman.eng.auburn DOT edu] <B>On Behalf Of =
</B>Hindle,=20
  Greg<BR><B>Sent:</B> 15 May 2006 15:06<BR><B>To:</B> Len Boyle;=20
  veritas-bu AT mailman.eng.auburn DOT edu<BR><B>Subject:</B> RE: [Veritas-bu] =
Bpsched=20
  crashing<BR></FONT><BR></DIV>
  <DIV></DIV>
  <DIV dir=3Dltr align=3Dleft><SPAN class=3D332480114-15052006><FONT =
face=3DArial=20
  color=3D#0000ff size=3D2>We had the shared memory setting set to use =
all available=20
  memory and were told by Symantec to lower that figure to 6 gig and=20
  leave&nbsp;2 gig for&nbsp;Solaris 9 (we jave 8 gig ram). We have not =
adjusted=20
  any msg queues. Where would I look and what should=20
  they&nbsp;be?&nbsp;</FONT></SPAN></DIV>
  <DIV>&nbsp;</DIV><!-- Converted from text/rtf format --><BR>
  <P><SPAN lang=3Den-us><FONT face=3D"Times New Roman"=20
  size=3D2>Greg</FONT></SPAN></P><BR>
  <DIV class=3DOutlookMessageHeader lang=3Den-us dir=3Dltr align=3Dleft>
  <HR tabIndex=3D-1>
  <FONT face=3DTahoma size=3D2><B>From:</B> Len Boyle =
[mailto:Len.Boyle AT sas DOT com]=20
  <BR><B>Sent:</B> Monday, May 15, 2006 9:48 AM<BR><B>To:</B> Hindle, =
Greg;=20
  veritas-bu AT mailman.eng.auburn DOT edu<BR><B>Subject:</B> RE: [Veritas-bu] =
Bpsched=20
  crashing<BR></FONT><BR></DIV>
  <DIV></DIV>
  <DIV dir=3Dltr align=3Dleft><SPAN class=3D995502613-15052006><FONT =
face=3DArial=20
  color=3D#0000ff size=3D2>Good Morning Greg, </FONT></SPAN></DIV>
  <DIV dir=3Dltr align=3Dleft><SPAN class=3D995502613-15052006><FONT =
face=3DArial=20
  color=3D#0000ff size=3D2></FONT></SPAN>&nbsp;</DIV>
  <DIV dir=3Dltr align=3Dleft><SPAN class=3D995502613-15052006><FONT =
face=3DArial=20
  color=3D#0000ff size=3D2>Have you changed setting in the /etc/system =
file to=20
  increased things such as shared memory and msg queues? =
</FONT></SPAN></DIV>
  <DIV dir=3Dltr align=3Dleft><SPAN class=3D995502613-15052006><FONT =
face=3DArial=20
  color=3D#0000ff size=3D2></FONT></SPAN>&nbsp;</DIV>
  <DIV dir=3Dltr align=3Dleft><SPAN class=3D995502613-15052006><FONT =
face=3DArial=20
  color=3D#0000ff size=3D2>len</FONT></SPAN></DIV><BR>
  <DIV class=3DOutlookMessageHeader lang=3Den-us dir=3Dltr align=3Dleft>
  <HR tabIndex=3D-1>
  <FONT face=3DTahoma size=3D2><B>From:</B> =
veritas-bu-admin AT mailman.eng.auburn DOT edu=20
  [mailto:veritas-bu-admin AT mailman.eng.auburn DOT edu] <B>On Behalf Of =
</B>Hindle,=20
  Greg<BR><B>Sent:</B> Monday, May 15, 2006 9:04 AM<BR><B>To:</B>=20
  veritas-bu AT mailman.eng.auburn DOT edu<BR><B>Subject:</B> [Veritas-bu] =
Bpsched=20
  crashing<BR></FONT><BR></DIV>
  <DIV></DIV><!-- Converted from text/rtf format -->
  <P><FONT face=3DArial size=3D2>Nb 5.0 mp6 Solaris 9</FONT> </P>
  <P><FONT face=3DArial size=3D2>We are having an on going issue with =
bpsched=20
  crashing/stopping. When this happens all the jobs go 150. Sometimes it =

  recovers and restarts the jobs and other times we have to stop and =
start the=20
  services. Has any one else had this issues and what you did to fix? We =
have a=20
  open ticket with Symantec and they have recommend some tuning changes, =
which=20
  we have done, but it still went down over the weekend. Symantec thinks =
it is a=20
  resource issue. We have 8 gig of ram in the master server and it seems =
to=20
  crash most often about 15 minutes in the main backup window. We would =
have=20
  around 200 jobs running, with 800 queued. This has not been an issue =
in the=20
  past. Any ideas?</FONT></P><BR>
  <P><FONT face=3D"Times New Roman" size=3D2>Greg</FONT> =
</P><PRE>&gt;&gt;&gt; This e-mail and any attachments are confidential, =
may contain legal, professional or other privileged information, and are =
intended solely for the addressee.  If you are not the intended =
recipient, do not use the information in this e-mail in any way, delete =
this e-mail and notify the sender. CEG-IP1
</PRE></BLOCKQUOTE></DIV>
<DIV>
<HR>
</DIV>
<P CLASS=3D"BulletedList" STYLE=3D"MARGIN: 0in 0in 0pt; TEXT-INDENT: =
0in; mso-list: none; tab-stops: .5in"><SPAN STYLE=3D"FONT-SIZE: 8pt; =
COLOR: gray; mso-bidi-font-family: Arial"><FONT FACE=3D"Arial">NOTICE: =
If received in error, please destroy and notify sender.<SPAN =
STYLE=3D"mso-spacerun: yes">&nbsp; </SPAN>Sender does not waive =
confidentiality or privilege, and use is prohibited.</FONT></SPAN></P>
<DIV>
</DIV></BODY></HTML>

------_=_NextPart_001_01C678E9.2DCF5D18--

<Prev in Thread] Current Thread [Next in Thread>