ADSM-L

Netware Backup Shows as COMPLETE but is not !!

2002-01-18 09:41:13
Subject: Netware Backup Shows as COMPLETE but is not !!
From: Robin Lowe <robin_lowe AT STANDARDLIFE DOT COM>
Date: Fri, 18 Jan 2002 14:38:08 -0000
We have unearthed a wide spread problem due to ongoing ANS1872E errors on
Novell Clients (that have been widely discussed here).
We are aware of the problem, and are taking steps to address the root cause.

However, we have discovered the problem is more widespread than we first
realised due to what we interpret as mis-leading information from the EVENT
records.

We discovered that when TSM goes to backup these affected clients, the event
record shows the backup as COMPLETED and not as FAILED which we expected
thus :


Scheduled Start            Actual Start            Schedule Name       Node
Name       Status
--------------------          --------------------         -------------
         -------------           ---------
         -------------           ---------
01/16/2002 20:00:00  01/16/2002 20:01:36  SCH_BRLAN_DLY SLUKWAF1
Completed
01/17/2002 20:00:00  01/17/2002 21:52:28  SCH_BRLAN_DLY SLUKWAF1
Completed
01/18/2002 20:00:00                                   SCH_BRLAN_DLY SLUKWAF1
Future

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
++++++++++++++++++++
From the dsmsched.log :

17.01.2002 21:53:52 Incremental backup of volume 'SYS:'
17.01.2002 21:53:52 Incremental backup of volume 'DATA:'
17.01.2002 21:53:52 Incremental backup of volume 'APPLICS:'
17.01.2002 21:53:53 ANS1872E Unable to connect to NetWare target service
'SLUKWAF1.NetWare File System'.
Make sure the TSA NLM is loaded on the specified machine.
17.01.2002 21:53:53 ANS1228E Sending of object 'SYS:' failed
17.01.2002 21:53:53 Unknown system error
Please check the TSM Error Log for any additional information

17.01.2002 21:53:53 ANS1872E Unable to connect to NetWare target service
'SLUKWAF1.NetWare File System'.
Make sure the TSA NLM is loaded on the specified machine.
17.01.2002 21:53:54 ANS1872E Unable to connect to NetWare target service
'SLUKWAF1.NetWare File System'.
Make sure the TSA NLM is loaded on the specified machine.
17.01.2002 21:53:54 ANS1228E Sending of object 'DATA:' failed
17.01.2002 21:53:54 Unknown system error
Please check the TSM Error Log for any additional information

17.01.2002 21:53:54 ANS1228E Sending of object 'APPLICS:' failed
17.01.2002 21:53:54 Unknown system error
Please check the TSM Error Log for any additional information

17.01.2002 21:53:57 --- SCHEDULEREC STATUS BEGIN
17.01.2002 21:53:57 --- SCHEDULEREC OBJECT END SCH_BRLAN_DLY 17.01.2002
20:00:00
17.01.2002 21:53:57 Scheduled event 'SCH_BRLAN_DLY' completed successfully.
17.01.2002 21:53:57 Sending results for scheduled event 'SCH_BRLAN_DLY'.
17.01.2002 21:53:57 Results sent to server for scheduled event
'SCH_BRLAN_DLY'.

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
+++++++++++++++++++++++

and the dsmerror.log :

7.01.2002 21:53:53 ANS1872E Unable to connect to NetWare target service
'SLUKWAF1.NetWare File System'.
Make sure the TSA NLM is loaded on the specified machine.
17.01.2002 21:53:53 (SMDR-5.0-21) The maximum number of connections allowed
through the SMDR (64) has been exceeded.
17.01.2002 21:53:53 ANS1228E Sending of object 'SYS:' failed
17.01.2002 21:53:53 Return code 3006 unknown
17.01.2002 21:53:53 Return code 3006 unknown
17.01.2002 21:53:53 ANS1872E Unable to connect to NetWare target service
'SLUKWAF1.NetWare File System'.
Make sure the TSA NLM is loaded on the specified machine.
17.01.2002 21:53:53 (SMDR-5.0-21) The maximum number of connections allowed
through the SMDR (64) has been exceeded.
17.01.2002 21:53:54 ANS1872E Unable to connect to NetWare target service
'SLUKWAF1.NetWare File System'.
Make sure the TSA NLM is loaded on the specified machine.
17.01.2002 21:53:54 (SMDR-5.0-21) The maximum number of connections allowed
through the SMDR (64) has been exceeded.
17.01.2002 21:53:54 ANS1228E Sending of object 'DATA:' failed
17.01.2002 21:53:54 Return code 3006 unknown
17.01.2002 21:53:54 Return code 3006 unknown
17.01.2002 21:53:54 ANS1228E Sending of object 'APPLICS:' failed
17.01.2002 21:53:54 Return code 3006 unknown
17.01.2002 21:53:54 Return code 3006 unknown

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
++++++++++++++++++++++++++


Okay, can someone explain how this can be a SUCCESSFULL backup when no data
is transferred ?


Is this a bug, or TSM working as designed ?

If a bug I will open a PMR, but personally I feel that if we cannot get to
the Novell volumes (objects) SYS: DATA: APPLICS: then this should be deemed
a FAIL?


Has anyone else seen this as a problem?
The fact is that we now have to go to several dozen clients and interrogate
the dsmsched.log/dsmerror.log to find out which nodes are affected.

By the way, does anyone have a suggestion as to how to dynamically collate
the errors in the dsmsched.log to say TIVOLI TEC for example, and is the
overhead acceptable on approx 200 Novell servers ?
Or is there a good product available to do this task ?


Thanks

Robin Lowe
Senior Storage Analyst









For more information on Standard Life, visit our website
http://www.standardlife.com/   The Standard Life Assurance Company, Standard
Life House, 30 Lothian Road, Edinburgh EH1 2DH, is registered in Scotland
(No SZ4) and regulated by the Personal Investment Authority.  Tel: 0131 225
2552 - calls may be recorded or monitored.  This confidential e-mail is for
the addressee only.  If received in error, do not retain/copy/disclose it
without our consent and please return it to us.  We virus scan all e-mails
but are not responsible for any damage caused by a virus or alteration by a
third party after it is sent.
<Prev in Thread] Current Thread [Next in Thread>