ADSM-L

Fw: HELP!!!!

2005-10-04 08:13:33
Subject: Fw: HELP!!!!
From: Joni Moyer <joni.moyer AT HIGHMARK DOT COM>
To: ADSM-L AT VM.MARIST DOT EDU
Date: Tue, 4 Oct 2005 08:13:03 -0400
I found some other errors as well, do any of these look familiar?  I've
place a call with IBM, but I'm really at a loss here.  For our NDMP backups
it stated an issue with malloc, but I don't know where this problem is
beginning...

Could I have an issue with the size of the bufferpool on this server?  Here
are the options for this server and it has 4 processors and 8GB of memory.

Server Option      Option Setting        Server Option      Option Setting

-----------------  --------------------  -----------------
--------------------
CommTimeOut        7,200                 IdleTimeOut        360

BufPoolSize        419432                LogPoolSize        1024

MessageFormat      1                     Language           en_US

Alias Halt         HALT                  MaxSessions        500

ExpInterval        0                     ExpQuiet           No

EventServer        Yes                   MirrorRead DB      Normal

MirrorRead LOG     Normal                MirrorWrite DB     Sequential

MirrorWrite LOG    Parallel              VolumeHistory
/t01/volhist/hist01
VolumeHistory      /usr/tivoli/tsm/ser-  Devconfig
/t01/devconfig/dev01
                    ver/bin/volhist

Devconfig          /usr/tivoli/tsm/ser-  TxnGroupMax        256

                    ver/bin/devconfig

MoveBatchSize      1000                  MoveSizeThresh     2048

StatusMsgCnt       10                    RestoreInterval    1,440

UseLargeBuffers    Yes                   DisableScheds      No

NOBUFPREfetch      No                    AuditStorage       Yes

REQSYSauthoutfile  Yes                   SELFTUNEBUFpools-  Yes

                                          ize

SELFTUNETXNsize    Yes                   DBPAGEShadow       No

DBPAGESHADOWFile   dbpgshdw.bdt          MsgStackTrace      On

QueryAuth          None                  LogWarnFullPerCe-  75

                                          nt

ThroughPutDataTh-  0                     ThroughPutTimeTh-  0

 reshold                                  reshold

NOPREEMPT          ( No )                Resource Timeout   60

TEC UTF8 Events    No                    NORETRIEVEDATE     No

DNSLOOKUP          Yes

TCPPort            1500                  TcpAdminport       1500

HTTPPort           1580                  TCPWindowsize      65536

TCPBufsize         16384                 TCPNoDelay         Yes

CommMethod         TCPIP                 CommMethod         ShMem

CommMethod         HTTP                  MsgInterval        1

ShmPort            1510                  FileExit
/t01/log/eventserve-
                                                             r(APPEND)

UserExit                                 FileTextExit

AssistVCRRecovery  Yes                   AcsAccessId        chrs144

AcsTimeoutX        1                     AcsLockDrive       No

AcsQuickInit       No                    SNMPSubagentPort   1521

SNMPSubagentHost   127.0.0.1             SNMPHeartBeatInt   5

TECHost                                  TECPort            0

UNIQUETECevents    No                    UNIQUETDPTECeven-  No

                                          ts

Async I/O          No                    Direct I/O         Yes

SHAREDLIBIDLE      No                    3494Shared         No




         DATE_TIME           MSGNO     MESSAGE
------------------     -----------     ------------------
        2005-10-03            9999     ANR9999D
   22:02:43.000000                      imgroup.c(1180):
                                        ThreadId<511>
                                        Error 8
                                        retrieving Backup
                                        Objects row for
                                        object
                                        0.295703482
                                        Callchain of
                                        previous message:
                                        0x0000000100017d-
                                        94 outDiagf <-
                                        0x00000001003dea-
                                        d4 imIsGroupLead-
                                        er <- 0x00000001-
                                        00385564
                                        SmNodeSession <-
                                        0x000000010043bb-
                                        38 HandleNodeSes-
                                        sion <-
                                        0x00000001004419-
                                        64 smExecuteSess-
                                        ion <-
                                        0x00000001004344-
                                        78 SessionThread
                                        <- 0x00000001000-
                                        08078 StartThread
                                        <- 0x09000000002-
                                        f4460 _pthread_b-
                                        ody <-  (SESSION:
                                        40004)
        2005-10-03            9999     ANR9999D
   22:02:43.000000                      smnode.c(7056):
                                        ThreadId<511>
                                        Session 40004:
                                        Invalid Group Id
                                        0,295703482 for
                                        ADD function
                                        Callchain of
                                        previous message:
                                        0x0000000100017d-
                                        94 outDiagf <-
                                        0x00000001003855-
                                        8c SmNodeSession
                                        <- 0x00000001004-
                                        3bb38 HandleNode-
                                        Session <-
                                        0x00000001004419-
                                        64 smExecuteSess-
                                        ion <-
                                        0x00000001004344-
                                        78 SessionThread
                                        <- 0x00000001000-
                                        08078 StartThread
                                        <- 0x09000000002-
                                        f4460 _pthread_b-
                                        ody <-  (SESSION:
        2005-10-03            8311     ANR8311E An I/O
   22:37:12.000000                      error occurred
                                        while accessing
                                        drive SL8500
                                        (/dev/rmt7) for
                                        LOCATE operation,
                                        errno = 78.
                                        (SESSION: 29020,
                                        PROCESS: 680)
        2005-10-03            1165     ANR1165E Error
   22:37:13.000000                      detected for file
                                        in storage pool
                                        TAPE_ORACLE: Node
                                        FJSU102, Type
                                        Backup, File
                                        space /p01, fsId
                                        18, File name
                                        /app/cyb/esp/MED-
                                        -ESPSystemAgent/-
                                        spool/CM_DEMO/MA-
                                        IN/MEDAXBP.1366/
                                        VMSDBSAP.
                                        (SESSION: 29020,
                                        PROCESS: 680)
        2005-10-03            3523     ANR3523W GENERATE
   22:37:13.000000                      BACKUPSET:
                                        Retrieve failed
                                        - error on input
                                        storage device.
                                        (SESSION: 29020,
                                        PROCESS: 680)
        2005-10-03            3503     ANR3503E
   22:37:13.000000                      Generation of
                                        backup set for
                                        FJSU102 as
                                        FJSU102_BACKUPSE-
                                        T.295467854
                                        failed. (SESSION:
                                        29020, PROCESS:
                                        680)
        2005-10-03            2032     ANR2032E GENERATE
   22:37:14.000000                      BACKUPSET:
                                        Command failed -
                                        internal server
                                        error detected.
                                        (SESSION: 29020,
                                        PROCESS: 680)
        2005-10-03            9999     ANR9999D
   22:50:53.000000                      imgroup.c(1180):
                                        ThreadId<437>
                                        Error 8
                                        retrieving Backup
                                        Objects row for
                                        object
                                        0.295537065
                                        Callchain of
                                        previous message:
                                        0x0000000100017d-
                                        94 outDiagf <-
                                        0x00000001003dea-
                                        d4 imIsGroupLead-
                                        er <- 0x00000001-
                                        00385564
                                        SmNodeSession <-
                                        0x000000010043bb-
                                        38 HandleNodeSes-
                                        sion <-
                                        0x00000001004419-
                                        64 smExecuteSess-
                                        ion <-
                                        0x00000001004344-
                                        78 SessionThread
                                        <- 0x00000001000-
                                        08078 StartThread
                                        <- 0x09000000002-
                                        f4460 _pthread_b-
                                        ody <-  (SESSION:
                                        39214)
        2005-10-03            9999     ANR9999D
   22:50:53.000000                      smnode.c(7056):
                                        ThreadId<437>
                                        Session 39214:
                                        Invalid Group Id
                                        0,295537065 for
                                        ADD function
                                        Callchain of
                                        previous message:
                                        0x0000000100017d-
                                        94 outDiagf <-
                                        0x00000001003855-
                                        8c SmNodeSession
                                        <- 0x00000001004-
                                        3bb38 HandleNode-
                                        Session <-
                                        0x00000001004419-
                                        64 smExecuteSess-
                                        ion <-
                                        0x00000001004344-
                                        78 SessionThread
                                        <- 0x00000001000-
                                        08078 StartThread
                                        <- 0x09000000002-
                                        f4460 _pthread_b-
                                        ody <-  (SESSION:
                                        39214)
        2005-10-03             423     ANR0423W Session
   22:51:42.000000                      41306 for
                                        administrator  (
                                        ) refused -
                                        administrator
                                        name not
                                        registered.
                                        (SESSION: 41306)
        2005-10-03            8311     ANR8311E An I/O
   22:52:14.000000                      error occurred
                                        while accessing
                                        drive SL8500
                                        (/dev/rmt7) for
                                        OFFL operation,
                                        errno = 78.
                                        (SESSION: 29020,
                                        PROCESS: 680)
        2005-10-03            8769     ANR8769E External
   23:34:45.000000                      media management
                                        function DISMOUNT
                                        returned
                                        result=LIBRARY_E-
                                        RROR. (SESSION:
                                        29020, PROCESS:
                                        680)
        2005-10-03            8469     ANR8469E Dismount
   23:34:45.000000                      of LTO volume
                                        T00897 from drive
                                        SL8500
                                        (/dev/rmt7) in
                                        library SL8500
                                        failed. (SESSION:
                                        29020, PROCESS:
                                        680)
        2005-10-03            1410     ANR1410W Access
   23:34:46.000000                      mode for volume
                                        T00897 now set to
                                        "unavailable".
                                        (SESSION: 29020,
                                        PROCESS: 680)
        2005-10-03            9999     ANR9999D
   23:46:25.000000                      ssremote.c(503):
                                        ThreadId<136>
                                        Unable to open
                                        remote session of
                                        type 1. Callchain
                                        of previous
                                        message:
                                        0x0000000100017d-
                                        94 outDiagf <-
                                        0x00000001004a41-
                                        78 ssInitStoreRe-
                                        mote <-
                                        0x000000010066ad-
                                        10 AfInitStoreRe-
                                        mote <-
                                        0x00000001006670-
                                        24 bfInitStoreRe-
                                        mote <-
                                        0x00000001006a05-
                                        c4 DoBackup <-
                                        0x00000001006a3e-
                                        5c AdmBackupNode
                                        <- 0x00000001001-
                                        63168 AdmCommand-
                                        Local <-
                                        0x00000001001642-
                                        ac admCommand <-
                                        0x000000010015b1-
                                        80 RunScript <-
                                        0x000000010015cd-
                                        30 DoRunScript <-
                                        0x00000001001631-
                                        68 AdmCommandLoc-
                                        al <- 0x00000001-
                                        001642ac
                                        admCommand <-
                                        0x000000010064ec-
                                        58 SmExecSchedul-
                                        edCommand <-
                                        0x000000010064ee-
                                        54 smScheduledCo-
                                        nsoleSession <-
                                        0x000000010064c8-
                                        60 CsRunCmdThread
                                        <- 0x00000001000-
                                        08078 StartThread
                                        <- 0x09000000002-
                                        f4460 _pthread_b-
                                        ody <-  (SESSION:
                                        38192, PROCESS:
                                        963)
        2005-10-03            2032     ANR2032E BACKUP
   23:46:25.000000                      NODE: Command
                                        failed - internal
                                        server error
                                        detected.
                                        (SESSION: 38192,
                                        PROCESS: 963)
        2005-10-03            1463     ANR1463E RUN:
   23:46:25.000000                      Command script
                                        NAS_2-DIFFERENTI-
                                        AL completed in
                                        error. (SESSION:
                                        38192, PROCESS:
                                        963)
        2005-10-03            2752     ANR2752E Scheduled
   23:46:25.000000                      command
                                        NAS_2-DIFFERENTI-
                                        AL failed.
                                        (SESSION: 38192,
                                        PROCESS: 963)

********************************
Joni Moyer
Highmark
Storage Systems
Work:(717)302-6603
Fax:(717)302-5974
joni.moyer AT highmark DOT com
********************************


                                                                       
             "Richard Sims"                                            
             <rbs AT bu DOT edu>                                              
                                                                        To
             10/04/2005 07:56          "Joni Moyer"                    
             AM                        <joni.moyer AT highmark DOT com>       
                                                                        cc
                                                                       
                                                                   Subject
                                       Re: HELP!!!!                    
                                                                       
                                                                       
                                                                       
                                                                       
                                                                       
                                                                       




Joni - That's an error we haven't seen before.

Your best course of action is to call TSM Support.

   Richard Sims

On Oct 4, 2005, at 7:32 AM, Joni Moyer wrote:

> Has anyone ever seen this message before?  I have TSM 5.2.4 running
> on AIX
> 5.2 and it seems like this error message occurred and then all
> processing
> stopped and it almost looks like TSM stopped & restarted itself.  Any
> suggestions are appreciated!!!!  I'm completely lost in this
> situation.
> Thank you in advance!
>
> 10/03/05 23:46:25     ANR9999D ssremote.c(503): ThreadId<136>
> Unable to
> open
>                        remote session of type 1. Callchain of previous
> message:
>                        0x0000000100017d94 outDiagf <-
> 0x00000001004a4178
>
>                        ssInitStoreRemote <- 0x000000010066ad10
> AfInitStoreRemote
>                        <- 0x0000000100667024 bfInitStoreRemote <-
> 0x00000001006-
>                        a05c4 DoBackup <- 0x00000001006a3e5c
> AdmBackupNode
> <-
>                        0x0000000100163168 AdmCommandLocal <-
> 0x00000001001642ac
>                        admCommand <- 0x000000010015b180 RunScript <-
> 0x00000001-
>                        0015cd30 DoRunScript <- 0x0000000100163168
> AdmCommandLoc-
>                        al <- 0x00000001001642ac admCommand <-
> 0x000000010064ec58
>                        SmExecScheduledCommand <- 0x000000010064ee54
> smScheduled-
>                        ConsoleSession <- 0x000000010064c860
> CsRunCmdThread
> <-
>                        0x0000000100008078 StartThread <-
> 0x09000000002f4460
>
>                        _pthread_body <-  (SESSION: 38192, PROCESS:
> 963)
> 10/03/05 23:46:25     ANR2032E BACKUP NODE: Command failed - internal
> server
>                        error detected. (SESSION: 38192, PROCESS: 963)
>
> 10/03/05 23:46:25     ANR2753I (NAS_2-DIFFERENTIAL):ANR2032E BACKUP
> NODE:
>
>                        Command failed - (SESSION: 38192)
>
> 10/03/05 23:46:25     ANR2753I (NAS_2-DIFFERENTIAL):internal server
> error
>
>                        detected.  (SESSION: 38192)
>
> 10/03/05 23:46:25     ANR1463E RUN: Command script NAS_2-DIFFERENTIAL
> completed
>                        in error. (SESSION: 38192, PROCESS: 963)
>
>
>
> ********************************
> Joni Moyer
> Highmark
> Storage Systems
> Work:(717)302-6603
> Fax:(717)302-5974
> joni.moyer AT highmark DOT com
> ********************************
>



<Prev in Thread] Current Thread [Next in Thread>