ADSM-L

Re: Q STG hangs during reclamation

2006-03-01 12:12:20
Subject: Re: Q STG hangs during reclamation
From: Orville Lantto <orville.lantto AT GLASSHOUSE DOT COM>
To: ADSM-L AT VM.MARIST DOT EDU
Date: Wed, 1 Mar 2006 12:10:29 -0500
This seems to be the cause.  I see the same diagnostics.


IC48429: MIGRATE STGPOOL PREVENTS UPDATE STGPOOL FROM RUNNING AND HANGS 
SUBSEQUENT QUERY VOLUME AND SELECT FROM VOLUMES COMMANDS.

 

APAR status
Closed as program error.

Error description 
A MIGRATE STGPOOL command prevents UPDATE STGPOOL command from
running at the same time. If the UPDATE STGPOOL command is
issued while the MIGRATE STGPOOL is running, it will also
prevents QUERY VOLUME and SELECT FROM VOLUMES commands from
running as well.
A show lock command will show 1 transaction holding lock type
34000 with other transactions waiting for the same lock.
For example :
slot -> 79:
LockDesc: Type=34000, NameSpace=0, SummMode=sLock, Key=''
  Holder: Tsn=0:1.4265401449, Mode=sLock
  Waiter: Tsn=0:1.4265407080, Mode=ixLock
  Waiter: Tsn=0:1.4265407084, Mode=sLock
  Waiter: Tsn=0:1.4265407118, Mode=sLock
  Waiter: Tsn=0:1.4265407433, Mode=sLock
  Waiter: Tsn=0:1.4265407491, Mode=sLock
In the above output, transaction Tsn=0:1.4265401449 is holding
lock type 34000. A show txnt output will reveal for example :
slot -> 105:
Tsn=0:1.4265401449, Resurrected=False, InFlight=True,
  Distributed=False, Addr 82a778f8
  ThreadId=67, Timestamp=01/16/06 15:00:27,
  Creator=admcmd.c(1864) Participants=1, summaryVote=ReadOnly
  EndInFlight False, endThreadId 67, tmidx 0 0,
  processBatchCount 0, mustAbort False.
  Participant DB: voteReceived=False, ackReceived=False
  Locks held by Tsn=0:1.4265401449 :
    Type=34000, NameSpace=0, SummMode=sLock, Mode=sLock, Key=''
A show thread will show that the corresponding thread
( thread 67 in this case) belongs to the MIGRATE STGPOOL
command. For example :
Thread 67: CsRunCmdThread
 tid=17183, ktid=2384073, ptid=62, det=1, zomb=0, join=0,
 result=0, sess=0
  Awaiting cond pCtrl->allDone (0x18a5447e0),
  using mutex pCtrl->mutex
 (0x186dcf6d8), in AdmMigrateStgPool (0x1007b3e74)
  Stack trace:
    0x09000000003848fc _cond_wait_global
    0x090000000038530c _cond_wait
    0x0900000000385d6c pthread_cond_wait
    0x000000010000d91c pkWaitCondition
    0x00000001007b3e7c AdmMigrateStgPool
    0x00000001001d9728 AdmCommandLocal
    0x00000001001da668 admCommand
    0x0000000100547b34 SmExecScheduledCommand
    0x0000000100547d4c smScheduledConsoleSession
    0x0000000100545510 CsRunCmdThread
    0x000000010000e9dc StartThread
    0x090000000036f460 _pthread_body

TSM Versions Affected:
TSM 5.3 Servers on all platforms.

Additional Keywords:
hang hung wait performance lock conflict SS_LOCK_UNIVERSE
AdmMigrateStgPool AdmUpdateStgPool
Local fix 
Do not run the UPDATE STGPOOL command while the MIGRATE STGPOOL
command runs.
Problem summary 
****************************************************************
* USERS AFFECTED: All users of TSM 5.3.x servers that use      *
*                 command migration with WAIT=YES option       *
*                 enabled.                                     *
****************************************************************
* PROBLEM DESCRIPTION: See Error Description.                  *
****************************************************************
* RECOMMENDATION: Apply fixing level when available.           *
*                 This problem is currently projected to be    *
*                 fixed in level 5.3.3. Note that this is      *
*                 subject to change at the discretion of       *
*                 IBM.                                         *
****************************************************************
Due to lock contention, MIGRATE STGPOOL WAIT=YES command
could prevent the other commands from running for
the duration of the migration.
Problem conclusion 
The problem has been fixed.
Temporary fix 
Comments 
APAR information        
APAR number      IC48429        
Reported component name  TSM SERVER     
Reported component ID    5698ISMSV      
Reported release         53A    
Status   CLOSED PER     
PE       NoPE   
HIPER    NoHIPER        
Submitted date   2006-01-24     
Closed date      2006-02-13     
Last modified date       2006-02-13     

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:
PK19694 <http://www-1.ibm.com/support/docview.wss?uid=swg1PK19694> 

Modules/Macros


Fix information 
Fixed component name     TSM SERVER     
Fixed component ID       5698ISMSV      

Applicable component levels     
R53A PSY            UP  
R53H PSY            UP  
R53L PSY            UP  
R53S PSY            UP  
R53W PSY            UP  
        
Rate this page  
 
Orville L. Lantto
Glasshouse Technologies, Inc.
 
 

________________________________

From: ADSM: Dist Stor Manager on behalf of Rainer Wolf
Sent: Wed 3/1/2006 3:01 AM
To: ADSM-L AT VM.MARIST DOT EDU
Subject: Re: [ADSM-L] Q STG hangs during reclamation



Hi,
shortly had the same effect. There are various reasons for this behaviour - 
often hardware-related
sometimes maybe just software.
Our last 'hang of query stg ' results from a filesystem that was offline
because of a defect FC-Adapter  - so
first I would check ( on os-level ) if all
related disk-hardware -filesystems or raw partitions-  are ok and really usable.
In tsm you may also check the client events of the last 24 hours.
You should call ibm  for support.

Greetings
Rainer

Orville Lantto wrote:
>
> We are having problems on a TSM server instance on AIX.  Some simple commands 
> such as "Q Stg" appear to hang, along with their sessions, while reclamation 
> is in process.  Other commands like "Q Pr' do not hang.  The server is AIX 
> 5.2.5 with six instances of TSM Server 5.3.2.0 on it.  The disk is DS4300 
> with a 3584 tape library shared between the instances.
>
> Anyone have a guess as to why?
>
> Orville L. Lantto
> Glasshouse Technologies, Inc.
> Cell:  952-738-1933
>
>
> ________________________________
>
> From: ADSM: Dist Stor Manager on behalf of Prather, Wanda
> Sent: Tue 2/28/2006 4:50 PM
> To: ADSM-L AT VM.MARIST DOT EDU
> Subject: [ADSM-L] 3584 help
>
> Having a problem with the Tape Library Specialist/web app, so I want to
> know what level of microcode is on this 3584.
>
> Can you get that information from the web app, or for that matter, from
> the front LED panel of the 3584?
>
> Thanks!

--
------------------------------------------------------------------------
Rainer Wolf                          eMail:       rainer.wolf AT uni-ulm DOT de
kiz - Abt. Infrastruktur           Tel/Fax:      ++49 731 50-22482/22471
Universitaet Ulm                     wwweb:        http://kiz.uni-ulm.de

<Prev in Thread] Current Thread [Next in Thread>