ADSM-L

Re: restore hangs

2002-10-28 09:06:28
Subject: Re: restore hangs
From: Alexander Lazarevich <alazarev AT HERA.ITG.UIUC DOT EDU>
To: ADSM-L AT VM.MARIST DOT EDU
Date: Mon, 28 Oct 2002 08:02:53 -0600
hmm, im not sure what you are saying. on the client i run dsm. thats it.
that starts the dsm client, and i can backup or restore. maybe thats
called an incremental restore? its NOT on a schedule. im trying to do an
unscheduled restore.


heres exactly what happened:
i ran dsm on the unix client, started restoring, then the library came
accross a bad tape that it couldnt fix, and then it just stoped restoring,
it didnt kill the connection, it just stoped. when i came back the next
morning, the error message i saw on the client terminal was "data
unavailable to the server". i assume that means either 1) a tape in the
library is bad (i did see some real bad server messages errors for tape
16644F) 2) the previous sys admin had tapes defined in the library that
are NOT physically in the library, and i have no idea where they are, and
whereever they are i need to tell the server to forget about them. i
believe 1) is what happened, because i dont see any server message about a
missing tape, just the message about bad VCR on tape.

so my question is how do i tell the restore process to forget about
whatever its stopping on, and just continue the restore process.

thanks for your help though i really appreciate it!

alex
---                                                        ---
   Alex Lazarevich | Systems | Imaging Technology Group
   alazarev AT itg.uiuc DOT edu | (217)244-1565 | www.itg.uiuc.edu
---                                                        ---

On Sun, 27 Oct 2002, DFrance wrote:

> I assume you mean the "dsmc sched" process on the (Unix) client;  that is
> the daemon used for running scheduled commands... unless you are defining
> schedule on the server for doing the restore, it's not relevant (right?!).
>
> OTOH, it sounds like maybe you *are* running the restore using a client
> schedule;  for restartable (and most any other total-filesystem) restores, I
> would advise running command-line client from interactive connection to the
> client machine -- redirect output for logging purposes, if desired, but do
> keep the session from an interactive terminal connection.
>
>
> Don France
> Technical Architect -- Tivoli Certified Consultant
> Tivoli Storage Manager, WinNT/2K, AIX/Unix, OS/390
> San Jose, Ca
> (408) 257-3037
> mailto:don_france AT att DOT net
>
> Professional Association of Contract Employees
> (P.A.C.E. -- www.pacepros.com)
>
>
>
> -----Original Message-----
> From: ADSM: Dist Stor Manager [mailto:ADSM-L AT VM.MARIST DOT EDU]On Behalf Of
> Alexander Lazarevich
> Sent: Sunday, October 27, 2002 12:20 PM
> To: ADSM-L AT VM.MARIST DOT EDU
> Subject: Re: restore hangs
>
>
> totally cool, thaks don. the restartable process is there!
>
> one question though. the dsm client stoped and i killed that process on
> the client. do i need to restart dsm on the clinet before i restart the
> process on the server? i praobably do, im just asking in case.
>
> alex
>
> On Sun, 27 Oct 2002, DFrance wrote:
>
> > Tapes marked unavailable usually just need to be marked available (and
> > possibly checked back into the library).  I suggest setting it to
> readonly,
> > for now.
> >
> > If the msg you got is "waiting for files from the server..." that just
> > indicates no-query-restore is invoked, the client will be receiving all
> the
> > data and is responsible to filter out data it already has, based on your
> > specifications.  You probably have a restartable restore in the queue;  do
> a
> > "q rest" to see; if so, just run the restartable restore -- use the cmd
> > "restart restore" to resume where you left off.  (This was new in 3.1, and
> > it worked fine after the first PTF was installed.)
> >
> > Tapes can be marked unavailable if they are out of the library when the
> > system needs them, and the operator fails to respond to the reply-request
> to
> > check-in the tape;  also, sometimes a tape mount error can cause it to get
> > marked unavailable -- prevents it from being written.
> >
> > Don France
> > Technical Architect -- Tivoli Certified Consultant
> > Tivoli Storage Manager, WinNT/2K, AIX/Unix, OS/390
> > San Jose, Ca
> > (408) 257-3037
> > mailto:don_france AT att DOT net
> >
> > Professional Association of Contract Employees
> > (P.A.C.E. -- www.pacepros.com)
> >
> >
> >
> > -----Original Message-----
> > From: ADSM: Dist Stor Manager [mailto:ADSM-L AT VM.MARIST DOT EDU]On Behalf 
> > Of
> > Alexander Lazarevich
> > Sent: Sunday, October 27, 2002 9:37 AM
> > To: ADSM-L AT VM.MARIST DOT EDU
> > Subject: restore hangs
> >
> >
> > we are in major trouble.
> >
> > one of our fileservers is an AIX 4.3.3 running on a f50, with external
> > 80-pin SCSI drives in a tower attached to a SCSI card in the f50.
> >
> > last friday, the SCSI card for that tower failed, and all of the
> > filesystems on that tower are screwed! im so pissed. i can't mount any of
> > the filesystems, it just says I/O error, and i run an fsck, but that
> > doesnt fix it. i think the whole super block is totally lost.
> >
> > so, im trying to restore all those drives from tape. we have a 3575 tape
> > library and run adsmserv 3.1 on that same AIX machine. there are 8
> > filesystems that i need to restore. i started with one last night, it was
> > going fine, but when i woke up this morning, it had stopped about 1/2 way
> > through and complained about data unavailable to server. it only restored
> > about 7GB when i know there is about 18GB on that filesystem.
> >
> > i tried running the dsm client again, and chose not to overwrite any data,
> > but as soon as i start it, it just hangs there, doing nothing. maybe its
> > still comparing all the 7GB of data that is already restored, but it
> > doesnt say anything about that, it just says transfering...
> >
> > is this normal, or is something not working?
> >
> > also, i looked at the tapes in the library, and there are 2 marked as
> > unavailable. those tapes were not unavailable 2 days ago!! what happened?
> >
> > any ideas?
> >
> > man, if i cant restore this data, we are so screwed.
> >
> > alex
> > ---                                                        ---
> >    Alex Lazarevich | Systems | Imaging Technology Group
> >    alazarev AT itg.uiuc DOT edu | (217)244-1565 | www.itg.uiuc.edu
> > ---                                                        ---
> >
>

<Prev in Thread] Current Thread [Next in Thread>