ADSM-L

Re: [ADSM-L] Strange behaviour of TSM

2010-10-26 06:39:33
Subject: Re: [ADSM-L] Strange behaviour of TSM
From: Grigori Solonovitch <Grigori.Solonovitch AT AHLIUNITED DOT COM>
To: ADSM-L AT VM.MARIST DOT EDU
Date: Tue, 26 Oct 2010 13:37:49 +0300
My primary pool is FILE and there is no errors at all.

Grigori G. Solonovitch

Senior Technical Architect

Information Technology  Ahli United Bank Kuwait http://www.ahliunited.com.kw

Phone: (+965) 2231-2274  Mobile: (+965) 99798073  E-Mail: Grigori.Solonovitch 
AT ahliunited DOT com

Please consider the environment before printing this Email


-----Original Message-----
From: ADSM: Dist Stor Manager [mailto:ADSM-L AT VM.MARIST DOT EDU] On Behalf Of 
Christian Svensson
Sent: Tuesday, October 26, 2010 1:35 PM
To: ADSM-L AT VM.MARIST DOT EDU
Subject: [ADSM-L] SV: Strange behaviour of TSM

Hi Grigori,
Look for read errors on your tape pools.

/Christian

-----Ursprungligt meddelande-----
Från: Grigori Solonovitch [mailto:Grigori.Solonovitch AT AHLIUNITED DOT COM]
Skickat: den 26 oktober 2010 10:49
Till: ADSM-L AT VM.MARIST DOT EDU
Ämne: Strange behaviour of TSM

TSM Server 5.5.4.1 under AIX 5.3-12-01.



TSM Client 6.2.1.1 under AIX 5.3-12-01



TDP for Oracle 5.5.2.0 - Oracle 10.2.0.4



I am trying to duplicate database from production to test with point in time 
recovery.

It restores level 0 backups (13/10/10) and incremental backups from 14/10/10 
till 18/10/10 successfully, but for incremental backup from 19/10/10 I have got 
error:



channel t1: reading from backup piece /LPAR11/pibs.18.1.1798.1.732727838

channel t1: restored backup piece 1

piece handle=/LPAR11/pibs.18.1.1798.1.732727838 
tag=DATABASE_PIBS.LEVEL_1.DATE_18

channel t1: restore complete, elapsed time: 00:02:06

channel t1: starting incremental datafile backupset restore

channel t1: specifying datafile(s) to restore from backup set

destination for restore of datafile 00001: /home/oracle/DIBS/ibssys1

destination for restore of datafile 00002: /home/oracle/DIBS/ibsrbs1

destination for restore of datafile 00003: /home/oracle/DIBS/ibsaux1

destination for restore of datafile 00004: /home/oracle/DIBS/ibstls1

destination for restore of datafile 00005: /home/oracle/DIBS/ibsusr1

destination for restore of datafile 00007: /home/oracle/DIBS/ibsdphx1

destination for restore of datafile 00008: /home/oracle/DIBS/ibsiphx1

destination for restore of datafile 00009: /home/oracle/DIBS/ibsatm1

destination for restore of datafile 00010: /home/oracle/DIBS/ibsibs1

destination for restore of datafile 00011: /home/oracle/DIBS/ibsgfs1

channel t1: reading from backup piece /LPAR11/pibs.19.1.1802.1.732814241

released channel: t1

RMAN-00571: ===========================================================

RMAN-00569: =============== ERROR MESSAGE STACK FOLLOWS ===============

RMAN-00571: ===========================================================

RMAN-03002: failure of Duplicate Db command at 10/26/2010 09:42:55

RMAN-03015: error occurred in stored script Memory Script

ORA-19870: error reading backup piece /LPAR11/pibs.19.1.1802.1.732814241

ORA-19501: read error on file "/LPAR11/pibs.19.1.1802.1.732814241", blockno 1 
(blocksize=512)

ORA-27190: skgfrd: sbtread2 returned error

ORA-19511: Error received from media manager layer, error text:

   ANS1314E (RC14)   File data currently unavailable on server

RMAN>



It looks like required file is not found in the primary pool. It is very 
strange, because I could see required file on TSM Server:



LPAR11_ORA      /pibs_pibs   /LPAR11/pibs.18.1.1798.1.732727838 2010-10-18 
15:30:39          ACTIVE_VERSION       DBLPAR11 537320262

LPAR11_ORA      /pibs_pibs   /LPAR11/pibs.19.1.1802.1.732814241 2010-10-19 
15:30:43          ACTIVE_VERSION       DBLPAR11 537500861

LPAR11_ORA      /pibs_pibs   /LPAR11/pibs.20.1.1806.1.732900640 2010-10-20 
15:30:42          ACTIVE_VERSION       DBLPAR11 537684112



In addition, I have found that TSM Server is trying to restore data from 
available copy pools (automatically):



26/10/2010 09:42:54 ANR0403I Session 1082 ended for node LPAR11_ORA (TDP Oracle 
AIX). (SESSION: 1082)

26/10/2010 09:42:54 ANE4991I (Session: 1082, Node: LPAR11_ORA) TDP Oracle AIX 
ANU0599 TDP for Oracle: (389236): =>(LPAR11_ORA) ANU2527I Restore details for 
backup piece /pibs_pibs///LPAR11/pibs.19.1.1802.1.732814241. Total bytes 
received: 0. Total processing time: 00:00:01. Throughput rate: 0.00 Kb/Sec. 
(SESSION: 1082)

26/10/2010 09:42:54 ANE4991I (Session: 1082, Node: LPAR11_ORA) TDP Oracle AIX 
ANU0599 TDP for Oracle: (389236): =>(LPAR11_ORA) ANU2536I File 
/pibs_pibs///LPAR11/pibs.19.1.1802.1.732814241 = 0 bytes recieved(SESSION: 1082)

26/10/2010 09:42:54 ANR0565W Retrieve or restore failed for session 1082 for 
node LPAR11_ORA (TDP Oracle AIX). The storage volume CP2B145A is inaccessible. 
(SESSION: 1082)

26/10/2010 09:42:54 ANR1422W Read access denied for volume CP2B145A - volume 
access mode="offsite". (SESSION: 1082)

26/10/2010 09:42:54 ANR1401W Mount request denied for volume /DCP3_DR/DCP30018 
- mount failed. (SESSION: 1082)

26/10/2010 09:42:54 ANR0406I Session 1082 started for node LPAR11_ORA (TDP 
Oracle AIX) (Tcp/Ip LPAR03(61008)). (SESSION: 1082)

26/10/2010 09:42:44 ANR0403I Session 1049 ended for node LPAR11_ORA (TDP Oracle 
AIX). (SESSION: 1049)

26/10/2010 09:42:44 ANR0514I Session 1049 closed volume /DAILY_2/DAY20020. 
(SESSION: 1049)

26/10/2010 09:42:44 ANE4991I (Session: 1049, Node: LPAR11_ORA) TDP Oracle AIX 
ANU0599 TDP for Oracle: (389236): =>(LPAR11_ORA) ANU2527I Restore details for 
backup piece /pibs_pibs///LPAR11/pibs.18.1.1798.1.732727838. Total bytes 
received: 1967652864. Total processing time: 00:01:56. Throughput rate: 
16564.97 Kb/Sec. (SESSION: 1049)

26/10/2010 09:42:44 ANE4991I (Session: 1049, Node: LPAR11_ORA) TDP Oracle AIX 
ANU0599 TDP for Oracle: (389236): =>(LPAR11_ORA) ANU2536I File 
/pibs_pibs///LPAR11/pibs.18.1.1798.1.732727838 = 1967652864 bytes 
recieved(SESSION: 1049)



As you see TSM Server is trying to mount copy volume CP2B145A from out-of 
country copy pool and then trying to mount volume DCP3_DR/DCP30018 from 
disaster site, which was unreachable due to Disaster Site testing.



I have checked and found no problems in primary pools. In addition, we have 
mounted remote Disaster Site copy pool (type FILE)  and repeated duplicate 
operation successfully (with reading data from Disaster Site copy pool !!!!!!).



What does it mean? Database corruption? Primary pool corruption?



There are no error messages at all!!!!



I will deeply appreciate any comments.



Grigori G. Solonovitch

Senior Technical Architect

Information Technology  Ahli United Bank Kuwait http://www.ahliunited.com.kw

Phone: (+965) 2231-2274  Mobile: (+965) 99798073  E-Mail: Grigori.Solonovitch 
AT ahliunited DOT com

Please consider the environment before printing this Email



________________________________
CONFIDENTIALITY AND WAIVER: The information contained in this electronic mail 
message and any attachments hereto may be legally privileged and confidential. 
The information is intended only for the recipient(s) named in this message. If 
you are not the intended recipient you are notified that any use, disclosure, 
copying or distribution is prohibited. If you have received this in error 
please contact the sender and delete this message and any attachments from your 
computer system. We do not guarantee that this message or any attachment to it 
is secure or free from errors, computer viruses or other conditions that may 
damage or interfere with data, hardware or software.

Please consider the environment before printing this Email.

CONFIDENTIALITY AND WAIVER: The information contained in this electronic mail 
message and any attachments hereto may be legally privileged and confidential. 
The information is intended only for the recipient(s) named in this message. If 
you are not the intended recipient you are notified that any use, disclosure, 
copying or distribution is prohibited. If you have received this in error 
please contact the sender and delete this message and any attachments from your 
computer system. We do not guarantee that this message or any attachment to it 
is secure or free from errors, computer viruses or other conditions that may 
damage or interfere with data, hardware or software.

<Prev in Thread] Current Thread [Next in Thread>