ADSM-L

Re: DSMC Exit Code Question

2003-06-27 17:52:08
Subject: Re: DSMC Exit Code Question
From: Andrew Raibeck <storman AT US.IBM DOT COM>
To: ADSM-L AT VM.MARIST DOT EDU
Date: Fri, 27 Jun 2003 14:51:31 -0700
Hi Chuck,

The issue you speak of has always been in ADSM/TSM. The product didn't
examine and back up one file at a time. Rather, files would be grouped
into transactions based on TXNGROUPMAX and TXNBYTELIMIT settings. If,
during the processing of a transaction, one of the files targeted for that
transaction was deleted between the time it was targeted and the time TSM
started to process it, you could see the same problem you attribute to
current versions of TSM. The larger the transaction, the bigger window of
opportunity to see this condition, especially for active file systems.

Starting with TSM 3.7, the product moved to a multithreaded, multisession
producer/consumer model for backup. Producer threads would scan the file
systems and queue up transactions to be processed (backed up) by consumer
threads. With this model, I suppose it is possible that the "window of
opportunity" I spoke of above is increased somewhat since transactions can
be queued up further in advance of the backup. But it isn't as if we've
changed the product so that TSM makes one pass to examine files, then a
second pass to back them up.

Regarding TSM failing to back files up when it should not, if I understand
correctly, you are saying that TSM should have backed up the file as soon
as it was identified, but because of the delay, the file was deleted so
TSM missed its opportunity. I'm not sure this is a failing on TSM's part
as such, because if the file system is *that* volatile, then even if TSM
worked in the manner you desire, it still might have missed the file if
the scan started a second or two later, i.e. if the file was there at
01:30:06 but not there at 01:30:8, then whether TSM backs this file up --
be it version 2.x, or 5.x -- is really a hit or miss proposition,
depending on the point in time that TSM reaches that spot in the file
system. The only difference is that in version 2.x, this situation was not
brought to your attention. Thus I don't see this as a failure in TSM, but
that with earlier versions you had a "what you don't know doesn't hurt
you" situation.

I don't think there is anything wrong with the suggestion to exclude these
volatile files/directories, but like any advice of this kind, one needs to
consider its applicability in one's environment.

Having said all of that, I am not trying to be dismissive of your
concerns. while I haven't seen very many complaints about this behavior, I
don't discount that in your environment it is annoying. But if we were,
hypothetically, to move to a "scan it/back it up" model, we'd be taking a
major step backwards in performance. But if you want to get closer to that
model, try setting RESOURCEUTILIZATION to 1 and see if it makes a
difference. You can also try MEMORYEFFICIENTBACKUP YES to see if that
helps slow the scan down a bit.

5.2 includes open file support for Windows 2000 and XP, which I think will
eliminate this issue entirely for you (on those Windows platforms).

I'm not sure that any of this is a real help; if you require a more
comprehensive solution, then please consider opening a requirement and/or
pursuing through your IBM representative.

Regards,

Andy

Andy Raibeck
IBM Software Group
Tivoli Storage Manager Client Development
Internal Notes e-mail: Andrew Raibeck/Tucson/IBM@IBMUS
Internet e-mail: storman AT us.ibm DOT com

The only dumb question is the one that goes unasked.
The command line is your friend.
"Good enough" is the enemy of excellence.




Chuck Mattern <Chuck_Mattern AT HOMEDEPOT DOT COM>
Sent by: "ADSM: Dist Stor Manager" <ADSM-L AT VM.MARIST DOT EDU>
06/27/2003 09:24
Please respond to "ADSM: Dist Stor Manager"


        To:     ADSM-L AT VM.MARIST DOT EDU
        cc:
        Subject:        Re: DSMC Exit Code Question



Hi Andy,

I agree that TSM should report a failure when a file identified as a
candidate for backup is not able to be backed up.  What I have a problem
with is that in ADSM v2.x and v3.x the files were backed up as they were
identified.  Once we went to ADSM v4.x and now TSM v5x it seems to have
adopted the dump methodology whereby it will examine a large number of
files (I'm truly not sure if it's hitting a directory at a time, a
filesystem at a time or a certain number of files) then come back to back
them up later.  Since I don't have the opportunity to quiesce my
filesystems I'm guaranteed to have a large number of failures and invest a
good amount of my engineers time in investigating a situation that never
should have occurred.  Essentially my complaint is not that TSM reports
failures, my complaint is that TSM fails to backup files when it should
not.  Like ADSM2, ADSM3, tar, pax, afio et al, it should take the file
when
it is identified, not wait until it's programmatically convenient.
Advising users to simply exclude any files or filesystems where some of
the
files tend to turnover rapidly is not realistic in a large environment of
diverse server configurations managed by a lean staff.

Regards,
Chuck




                      Andrew Raibeck
                      <storman AT US DOT IBM.C        To: ADSM-L AT vm.marist 
DOT edu
                      OM>                      cc:
                      Sent by: "ADSM:          Subject:  Re: DSMC Exit
Code Question
                      Dist Stor
                      Manager"
                      <[email protected]
                      .edu>


                      06/27/2003 01:19
                      AM
                      Please respond to
                      "ADSM: Dist Stor
                      Manager"






There is no way to modify TSM behavior as you describe. In fact, your
statement about excluding the files is exactly one of the reasons why TSM
behaves as it does: so you can evaluate the reasons for the missed files,
and take the necessary actions. We feel that it is better to flag the
backup with an RC 4 and let you decide that you don't need backups for the
particular files, than to flag the backup with an RC 0, only to discover
after it is too late that you really needed the backups (in which case,
you'd probably be asking why TSM didn't warn you that it couldn't back up
the files).

Regards,

Andy

Andy Raibeck
IBM Software Group
Tivoli Storage Manager Client Development
Internal Notes e-mail: Andrew Raibeck/Tucson/IBM@IBMUS
Internet e-mail: storman AT us.ibm DOT com

The only dumb question is the one that goes unasked.
The command line is your friend.
"Good enough" is the enemy of excellence.




Andy Carlson <andyc AT ANDYC.CARENET DOT ORG>
Sent by: "ADSM: Dist Stor Manager" <ADSM-L AT VM.MARIST DOT EDU>
06/26/2003 13:08
Please respond to "ADSM: Dist Stor Manager"


        To:     ADSM-L AT VM.MARIST DOT EDU
        cc:
        Subject:        DSMC Exit Code Question



I had an intersting question posed to me today and I don't have an
answer.  This person is checking the return code on the dsmc command to
determine if the backup needs to be looked at or not.  What he is coming
across is files that are in the directory when the directory is scanned,
but gone when the backup tries to back the file up.  We did figure out
we could probably exclude the files, but what we would like is for TSM
not to give a non-zero return code what a file is not found.  Thanks for
any input into this matter.


Andy Carlson                                    |\      _,,,---,,_
Senior Technical Specialist               ZZZzz /,`.-'`'    -.  ;-;;,_
BJC Health Care                                |,4-  ) )-,_. ,\ (  `'-'
St. Louis, Missouri                           '---''(_/--'  `-'\_)
Cat Pics: http://andyc.dyndns.org/animal.html

<Prev in Thread] Current Thread [Next in Thread>