ADSM-L

Re: TSM-Server crash : Help

2002-11-21 04:06:45
Subject: Re: TSM-Server crash : Help
From: "Frost, Dave" <Dave.Frost AT SUNGARD DOT COM>
To: ADSM-L AT VM.MARIST DOT EDU
Date: Thu, 21 Nov 2002 09:04:52 +0000
Christoph,

what were the functions in the trace-back?  Anything like this:

08/01/2002 16:58:43  ANR9999D Trace-back of called functions:
08/01/2002 16:58:43  ANR9999D   0x0000000100077208  pkFree
08/01/2002 16:58:43  ANR9999D   0x00000001006B0138  SmDoEventLog
08/01/2002 16:58:43  ANR9999D   0x00000001006AC9C0  SmNodeSession
08/01/2002 16:58:43  ANR9999D   0x00000001006999C8  HandleNodeSession
08/01/2002 16:58:43  ANR9999D   0x0000000100699C0C  DoNodeGeneral
08/01/2002 16:58:43  ANR9999D   0x0000000100697008  smExecuteSession
08/01/2002 16:58:43  ANR9999D   0x000000010008A1D8  SessionThread
08/01/2002 16:58:43  ANR9999D   0x000000010007B728  StartThread
08/01/2002 16:58:43  ANR9999D   0xFFFFFFFF7EC1F8A0  *UNKNOWN*
08/01/2002 16:58:43  ANR9999D   0x000000010007B620  StartThread

In which case, we have seen this after a client upgrade - in certain
circumstances the upgraded client will send invalid event information as it
starts, which will crash _any_ server except the latest v5.1.x.

Regards,

-=Dave=-
+44 (0) 20 7608 7140

A bad random number generator: 1, 1, 1, 1, 1, 4.33e+67, 1, 1, 1


|---------+----------------------------------------------->
|         |           Christoph Pilgram                   |
|         |           <[email protected]|
|         |           ELHEIM.COM>                         |
|         |           Sent by: "ADSM: Dist Stor Manager"  |
|         |           <ADSM-L AT VM.MARIST DOT EDU>              |
|         |                                               |
|         |                                               |
|         |           11/21/2002 08:34 AM                 |
|         |           Please respond to "ADSM: Dist Stor  |
|         |           Manager"                            |
|         |                                               |
|---------+----------------------------------------------->
  
>-----------------------------------------------------------------------------------------------|
  |                                                                             
                  |
  |       To:       ADSM-L AT VM.MARIST DOT EDU                                 
                         |
  |       cc:                                                                   
                  |
  |       Subject:  TSM-Server crash : Help                                     
                  |
  
>-----------------------------------------------------------------------------------------------|




Hi all,

since 3 days my TSM-Server (AIX 4.3.3 , TSM 4.1.4) crashs at night.

In dsmserv.err the following messages are written :

11/21/2002 01:31:06  ANR7834S Thread 72 (tid 482c) terminating on signal 11
(Seg
mentation violation).
11/21/2002 01:31:06  ANR7834S GPR  0: 0xffffffff,   1: 0x36a586b0,   2:
0x30199d48,   3: 0x35ba5ed0
11/21/2002 01:31:06  ANR7834S GPR  4: 0x00000000,   5: 0x00000000,   6:
0x00000001,   7: 0x1000b7db
11/21/2002 01:31:06  ANR7834S GPR  8: 0x0000b7db,   9: 0x00000000,  10:
0xf0218fd4,  11: 0x3634d384
11/21/2002 01:31:06  ANR7834S GPR 12: 0x10242198,  13: 0x00000000,  14:
0x00000001,  15: 0x00000001
11/21/2002 01:31:06  ANR7834S GPR 16: 0x00000000,  17: 0x00000000,  18:
0x00000000,  19: 0x00000000
11/21/2002 01:31:06  ANR7834S GPR 20: 0x00000000,  21: 0x00000000,  22:
0x00040000,  23: 0x300c3a38
11/21/2002 01:31:06  ANR7834S GPR 24: 0x00000000,  25: 0x00000001,  26:
0x00000000,  27: 0x35ba5ed0
11/21/2002 01:31:06  ANR7834S GPR 28: 0x35cf5ef0,  29: 0x300027a8,  30:
0x35cf5ef0,  31: 0x35cf5dc0
11/21/2002 01:31:06  ANR7834S IAR: 0x102421b0   LR: 0x10242198   CONTEXT:
0x36a58330
11/21/2002 01:31:06  ANR7833S Server thread 1 terminated in response to
program abort.
11/21/2002 01:31:06  ANR7833S Server thread 2 terminated in response to
program abort.

In actlog the last entries are errors-messages :
11/21/02 01:30:14 ANR0406I Session 1545 started for node TENTAX (OpenVMS)
(Tcp/Ip 148.192.120.14(1025)).
11/21/02 01:30:20 ANR0444W Protocol error on session 1101 for node TENTAX
(OpenVMS) - out-of-sequence verb (type (Unknown)) received.
11/21/02 01:30:21 ANR0484W Session 1101 for node TENTAX (OpenVMS)
terminated
- protocol violation detected.
11/21/02 01:30:30 ANR0480W Session 1102 for node TENTAX (OpenVMS)
terminated
- connection with client severed.
11/21/02 01:30:47 ANR0480W Session 1157 for node TENTAX (OpenVMS)
terminated
- connection with client severed.
11/21/02 01:31:05 ANR0444W Protocol error on session 1545 for node TENTAX
(OpenVMS) - out-of-sequence verb (type ArchQryResp1) received.

Can anybody give me some help.

Thanks
Chris

<Prev in Thread] Current Thread [Next in Thread>