Veritas-bu

[Veritas-bu] TLD Definition

2002-11-12 08:38:10
Subject: [Veritas-bu] TLD Definition
From: jerome.bauwens AT steria DOT com (jerome bauwens)
Date: Tue, 12 Nov 2002 14:38:10 +0100
This is a multi-part message in MIME format.

------=_NextPart_000_00F7_01C28A59.1FB5B640
Content-Type: text/plain;
        charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

Hi,=20

I'm using a NB 3.4 on 2 solaris servers with a L700 library and 9840 =
fiber channel drives.

My drive configuration were resetted on a media server this weekend for =
the second time in a week and it pisses me off.

[athena]:/usr/openv/netbackup/db/media>tpconfig -d

Index DriveName DrivePath Type Multihost Status

***** ********* ********** **** ********* ******

0 STK98402 /dev/rmt/0cbn hcart No UP

TLD(0) Definition DRIVE=3D1 <-------- should be drive 3

1 STK98403 /dev/rmt/1cbn hcart No UP

TLD(0) Definition DRIVE=3D4=20

Currently defined robotics are:

TLD(0) robot host =3D zeus, volume database host =3D zeus



The TLD definition for the STK98402 drive was DRIVE=3D3 last week and it =
just changed during the weekend causing the drive to go down and =
multiple jobs to fail as every request that should have been made to =
this drive were made to the STK98400 (DRIVE=3D1) on the other media =
server but failed as the drive was already used (I don't even think it =
would have worked even if the drive had not been used).  My first =
question is: has anybody seen this? My second is: How do you reset it =
using command lines (tpconfig doesn't) ?=20

Besides, a signal 15 seemed to cause the process to restart on both the =
master and media:

 file messages on the master (also media server for 4 drives):

Nov 9 14:27:15 zeus ltid[7769]: [ID 429237 daemon.notice] LTID - =
received ROBOT MESSAGE, Type=3D54, LongParam=3D0, Param1=3D1, Param2=3D0

Nov 9 15:02:19 zeus vmd[21335]: [ID 631293 daemon.notice] terminating - =
successful (0)

Nov 9 15:02:34 zeus tldcd[7975]: [ID 459737 daemon.error] Daemon has =
terminated due to signal (15)

Nov 9 15:02:34 zeus ltid[7769]: [ID 394161 daemon.error] LTID =
terminating because it received a signal (15)

Nov 9 15:02:36 zeus ltid[7769]: [ID 265732 daemon.warning] Sending =
shutdown to tldcd daemon...

Nov 9 15:04:09 zeus vmd[16381]: [ID 734361 daemon.notice] ready for =
connections on socket 2

Nov 9 15:04:13 zeus tldd[16546]: [ID 754584 daemon.notice] Device=3D0, =
TLD=3D0, DRIVE=3D1

Nov 9 15:04:13 zeus tldd[16546]: [ID 820121 daemon.notice] Device=3D1, =
TLD=3D0, DRIVE=3D2

Nov 9 15:04:13 zeus tldd[16546]: [ID 951196 daemon.notice] Device=3D3, =
TLD=3D0, DRIVE=3D5

Nov 9 15:04:13 zeus tldd[16546]: [ID 116752 daemon.notice] Device=3D4, =
TLD=3D0, DRIVE=3D6

Nov 9 15:04:13 zeus tldcd[16569]: [ID 617824 daemon.notice] Ready for =
connections

file messages on the media server which lost its configuration :

Nov 9 14:58:05 athena vmd[11698]: [ID 631293 daemon.notice] terminating =
- successful (0)

Nov 9 14:59:10 athena vmd[2226]: [ID 734361 daemon.notice] ready for =
connections on socket 2

Nov 9 14:59:14 athena tldd[2405]: [ID 754584 daemon.notice] Device=3D0, =
TLD=3D0, DRIVE=3D1   <-- should be 3

Nov 9 14:59:14 athena tldd[2405]: [ID 820123 daemon.notice] Device=3D1, =
TLD=3D0, DRIVE=3D4

Nov 9 14:59:29 athena tldd[2409]: [ID 801976 daemon.error] TLD(0) [2409] =
unable to connect to tldcd on zeus: Connection refused (14

6)

Nov 9 14:59:29 athena tldd[2405]: [ID 560769 daemon.notice] =
DecodeQuery() Actual status: Control daemon connect or protocol error

Nov 9 14:59:29 athena tldd[2405]: [ID 181918 daemon.error] TLD(0) =
unavailable: initialization failed: Control daemon connect or pro

tocol error

Nov 9 15:01:31 athena tldd[2405]: [ID 885524 daemon.notice] =
DecodeQuery() Actual status: STATUS_SUCCESS



The time difference is due to the actual time difference between the two =
servers.

Help will be greetly appreciated,

Jerome.


------=_NextPart_000_00F7_01C28A59.1FB5B640
Content-Type: text/html;
        charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META content=3D"text/html; charset=3Diso-8859-1" =
http-equiv=3DContent-Type>
<META content=3D"MSHTML 5.00.2314.1000" name=3DGENERATOR>
<STYLE></STYLE>
</HEAD>
<BODY bgColor=3D#ffffff>
<DIV><FONT face=3DArial>Hi, </FONT></DIV>
<DIV>&nbsp;</DIV>
<DIV><FONT face=3DArial>I'm using a NB 3.4 on 2 solaris servers with a =
L700=20
library and 9840 fiber channel drives.</FONT></DIV>
<DIV>&nbsp;</DIV>
<DIV><FONT face=3DArial>My drive configuration were resetted on a media =
server=20
this weekend for the second time in a week and it pisses me =
off.</FONT></DIV>
<DIV>&nbsp;</DIV>
<DIV><FONT face=3DArial size=3D2>
<P>[athena]:/usr/openv/netbackup/db/media&gt;tpconfig -d</P>
<P>Index DriveName DrivePath Type Multihost Status</P>
<P>***** ********* ********** **** ********* ******</P>
<P>0 STK98402 /dev/rmt/0cbn hcart No UP</P>
<P>TLD(0) Definition DRIVE=3D1 </FONT><FONT color=3D#ff0000 face=3DArial =

size=3D2>&lt;-------- should be drive 3</P></FONT><FONT face=3DArial =
size=3D2>
<P>1 STK98403 /dev/rmt/1cbn hcart No UP</P>
<P>TLD(0) Definition DRIVE=3D4 </P>
<P>Currently defined robotics are:</P>
<P>TLD(0) robot host =3D zeus, volume database host =3D zeus</P>
<P>&nbsp;</P>
<P>The TLD definition for the STK98402 drive was DRIVE=3D3 last week and =
it just=20
changed during the weekend causing the drive to go down and multiple =
jobs to=20
fail as every request that should have been made to this drive were made =
to the=20
STK98400 (DRIVE=3D1) on the other media server but failed as the drive =
was already=20
used (I don't even think it would have worked even if the drive had not =
been=20
used).&nbsp; My first question is: has anybody seen this?<FONT size=3D2> =
My second=20
is: How do you reset it using command lines (tpconfig doesn't) ? =
</FONT></P>
<P>Besides, a signal 15 seemed to cause the process to restart on both =
the=20
master and media:</P>
<P><FONT face=3Dr_ansi size=3D2><STRONG> </STRONG><FONT face=3DArial=20
size=3D2><STRONG>file messages on the master (also media server for 4=20
drives</STRONG>):</FONT></P>
<P><FONT face=3DArial>Nov 9 14:27:15 zeus ltid[7769]: [ID 429237 =
daemon.notice]=20
LTID - received ROBOT MESSAGE, Type=3D54, LongParam=3D0, Param1=3D1,=20
Param2=3D0</FONT></P>
<P><FONT face=3DArial>Nov 9 15:02:19 zeus vmd[21335]: [ID 631293 =
daemon.notice]=20
terminating - successful (0)</FONT></P>
<P><FONT face=3DArial>Nov 9 15:02:34 zeus tldcd[7975]: [ID 459737 =
daemon.error]=20
Daemon has terminated due to signal (15)</FONT></P>
<P><FONT face=3DArial>Nov 9 15:02:34 zeus ltid[7769]: [ID 394161 =
daemon.error]=20
LTID terminating because it received a signal (15)</FONT></P>
<P><FONT face=3DArial>Nov 9 15:02:36 zeus ltid[7769]: [ID 265732 =
daemon.warning]=20
Sending shutdown to tldcd daemon...</FONT></P>
<P><FONT face=3DArial>Nov 9 15:04:09 zeus vmd[16381]: [ID 734361 =
daemon.notice]=20
ready for connections on socket 2</FONT></P>
<P><FONT face=3DArial>Nov 9 15:04:13 zeus tldd[16546]: [ID 754584 =
daemon.notice]=20
Device=3D0, TLD=3D0, DRIVE=3D1</FONT></P>
<P><FONT face=3DArial>Nov 9 15:04:13 zeus tldd[16546]: [ID 820121 =
daemon.notice]=20
Device=3D1, TLD=3D0, DRIVE=3D2</FONT></P>
<P><FONT face=3DArial>Nov 9 15:04:13 zeus tldd[16546]: [ID 951196 =
daemon.notice]=20
Device=3D3, TLD=3D0, DRIVE=3D5</FONT></P>
<P><FONT face=3DArial>Nov 9 15:04:13 zeus tldd[16546]: [ID 116752 =
daemon.notice]=20
Device=3D4, TLD=3D0, DRIVE=3D6</FONT></P>
<P><FONT face=3DArial>Nov 9 15:04:13 zeus tldcd[16569]: [ID 617824 =
daemon.notice]=20
Ready for connections</FONT></P>
<P><FONT face=3DArial><STRONG>file messages on the media server which =
lost its=20
configuration :</STRONG></FONT></P>
<P><FONT face=3DArial>Nov 9 14:58:05 athena vmd[11698]: [ID 631293 =
daemon.notice]=20
terminating - successful (0)</FONT></P>
<P><FONT face=3DArial>Nov 9 14:59:10 athena vmd[2226]: [ID 734361 =
daemon.notice]=20
ready for connections on socket 2</FONT></P>
<P><FONT face=3DArial>Nov 9 14:59:14 athena tldd[2405]: [ID 754584 =
daemon.notice]=20
Device=3D0, TLD=3D0, DRIVE=3D1&nbsp;&nbsp; <FONT color=3D#800000>&lt;-- =
should be=20
3</FONT></FONT></P>
<P><FONT face=3DArial>Nov 9 14:59:14 athena tldd[2405]: [ID 820123 =
daemon.notice]=20
Device=3D1, TLD=3D0, DRIVE=3D4</FONT></P>
<P><FONT face=3DArial>Nov 9 14:59:29 athena tldd[2409]: [ID 801976 =
daemon.error]=20
TLD(0) [2409] unable to connect to tldcd on zeus: Connection refused=20
(14</FONT></P>
<P><FONT face=3DArial>6)</FONT></P>
<P><FONT face=3DArial>Nov 9 14:59:29 athena tldd[2405]: [ID 560769 =
daemon.notice]=20
DecodeQuery() Actual status: Control daemon connect or protocol =
error</FONT></P>
<P><FONT face=3DArial>Nov 9 14:59:29 athena tldd[2405]: [ID 181918 =
daemon.error]=20
TLD(0) unavailable: initialization failed: Control daemon connect or=20
pro</FONT></P>
<P><FONT face=3DArial>tocol error</FONT></P>
<P><FONT face=3DArial>Nov 9 15:01:31 athena tldd[2405]: [ID 885524 =
daemon.notice]=20
DecodeQuery() Actual status: STATUS_SUCCESS</FONT></P>
<P>&nbsp;</P>
<P><FONT face=3DArial>The time difference is due to the actual time =
difference=20
between the two servers.</FONT></P>
<P><FONT face=3DArial>Help will be greetly appreciated,</FONT></P>
<P><FONT =
face=3DArial>Jerome.</FONT></P></FONT></FONT></DIV></BODY></HTML>

------=_NextPart_000_00F7_01C28A59.1FB5B640--


<Prev in Thread] Current Thread [Next in Thread>