Networker

Re: [Networker] ! No data failure on NetApp F720 filer

2003-03-12 15:48:37
Subject: Re: [Networker] ! No data failure on NetApp F720 filer
From: "Lewis, Terry {Info~Palo Alto}" <TERRY.LEWIS AT ROCHE DOT COM>
To: NETWORKER AT LISTMAIL.TEMPLE DOT EDU
Date: Wed, 12 Mar 2003 12:48:24 -0800
Barney,
 
   A heavily loaded filer host may be to blame here.

   If this is something that just started happening, I suggest you
check the system load on the filer.  NetApp and Legato probably
suggest you switch to the NDMP client.  Anyway, similar 
failures happened to me when space usage on the filer crossed
some unknown threshold, causing the filer CPU to run busier
than it had before.  Apparently, the increased load slowed backup
operations so much that timeouts and other errors would occur
regularly.  For about 6 months, NetApp backups were spotty.
Now, after some clean-up, NetApp backups run fine again.

Terry

  

> -----Original Message-----
> From: Barney Mowder [SMTP:bmowder AT AURORA-SYS DOT COM]
> Sent: Wednesday, March 12, 2003 12:29 PM
> To:   NETWORKER AT LISTMAIL.TEMPLE DOT EDU
> Subject:      [Networker] ! No data failure on NetApp F720 filer
> 
> All-
> 
>   Have a problem with NetApp backups running on F720 Filer.
> 
>   We run Networker server on an NT 4.0 box which calls The Legato Business
> Suite client.  For some reason, it's now failing consistently with:  the
> following entries in the messages file,
> 
> Mar 11 03:00:03 acacia_bkp.aurora.com: NetWorker Savegroup: (info) starting
> NETAPP (with 1 client(s))
> Mar 11 03:00:32 acacia_bkp.aurora.com: NetWorker media: (info) suggest
> mounting NetApp.136 on acacia_bkp.aurora.com for writing  to pool 'NetApp'
> Mar 11 03:00:34 acacia_bkp.aurora.com: NetWorker media: (waiting) Waiting
> for 1 writable volumes to backup pool 'NetApp' tape(s) on
> acacia_bkp.aurora.com
> Mar 11 03:00:34 acacia_bkp.aurora.com: NetWorker media: (waiting) Waiting
> for 1 writable volumes to backup pool 'NetApp' tape(s) on
> acacia_bkp.aurora.com
> Mar 11 03:02:58 acacia_bkp.aurora.com: NetWorker Media: (info) loading
> volume NetApp.136 into \\.\Tape0
> Mar 11 04:25:59 acacia_bkp.aurora.com: NetWorker media: (info) suggest
> mounting acacia_bkp.aurora.com.171 on acacia_bkp.aurora.com for writing  to
> pool 'Default'
> Mar 11 04:26:01 acacia_bkp.aurora.com: NetWorker media: (waiting) Waiting
> for 1 writable volumes to backup pool 'Default' tape(s) on
> acacia_bkp.aurora.com
> Mar 11 04:26:01 acacia_bkp.aurora.com: NetWorker media: (waiting) Waiting
> for 1 writable volumes to backup pool 'Default' tape(s) on
> acacia_bkp.aurora.com
> Mar 11 04:26:36 acacia_bkp.aurora.com: NetWorker Media: (info) loading
> volume acacia_bkp.aurora.com.171 into \\.\Tape0
> Mar 11 04:37:05 acacia_bkp.aurora.com: NetWorker Savegroup: (alert) NETAPP
> completed, 1 client(s) (sequoia Failed)
> Mar 11 04:37:05 acacia_bkp.aurora.com: Start time:   Tue Mar 11 03:00:03
> 2003
> Mar 11 04:37:05 acacia_bkp.aurora.com: End time:     Tue Mar 11 04:37:05
> 2003
> Mar 11 04:37:05 acacia_bkp.aurora.com: --- Unsuccessful Save Sets ---
> Mar 11 04:37:05 acacia_bkp.aurora.com: * sequoia:/vol/vol0/home 1 retry
> attempted
> Mar 11 04:37:05 acacia_bkp.aurora.com: * sequoia:/vol/vol0/home ! no output
> Mar 11 04:37:05 acacia_bkp.aurora.com: --- Successful Save Sets ---
> Mar 11 04:37:05 acacia_bkp.aurora.com:   acacia_bkp.aurora.com:
> index:sequoia level=9,   3042 MB 00:10:53   2210 files
> 
> And the following entries in the daemon.log file:
> 
> 03/11/03 03:00:03 AM nsrd: savegroup info: starting NETAPP (with 1
> client(s))
> 03/11/03 03:00:32 AM nsrd: media info: suggest mounting NetApp.136 on
> acacia_bkp.aurora.com for writing  to pool 'NetApp'
> 03/11/03 03:00:34 AM nsrd: media waiting event: Waiting for 1 writable
> volumes to backup pool 'NetApp' tape(s) on acacia_bkp.aurora.com
> 03/11/03 03:00:35 AM nsrd: \\.\Tape0 Eject operation in progress
> 03/11/03 03:02:58 AM nsrd: media info: loading volume NetApp.136 into
> \\.\Tape0
> 03/11/03 03:03:10 AM nsrd: \\.\Tape0 Verify label operation in progress
> 03/11/03 03:04:04 AM nsrd: \\.\Tape0 Mount operation in progress
> 03/11/03 03:04:40 AM nsrd: media event cleared: Waiting for 1 writable> 
> volumes to backup pool 'NetApp' tape(s) on acacia_bkp.aurora.com
> 03/11/03 03:04:40 AM nsrd: sequoia:/vol/vol0/home saving to pool 'NetApp'
> (NetApp.136)
> 03/11/03 03:45:05 AM nsrd: sequoia:/vol/vol0/home done saving to pool
> 'NetApp' (NetApp.136) 85 MB
> 03/11/03 03:45:13 AM savegrp: sequoia:/vol/vol0/home unexpectedly exited.
> 03/11/03 03:45:13 AM savegrp: sequoia:/vol/vol0/home will retry 1 more
> time(s)
> 03/11/03 03:45:17 AM nsrd: sequoia:/vol/vol0/home saving to pool 'NetApp'
> (NetApp.136)
> 03/11/03 04:24:39 AM nsrd: sequoia:/vol/vol0/home done saving to pool
> 'NetApp' (NetApp.136) 85 MB
> 03/11/03 04:24:56 AM savegrp: sequoia:/vol/vol0/home unexpectedly exited.
> 03/11/03 04:24:56 AM savegrp: sequoia:/vol/vol0/home will retry 0 more
> time(s)
> 03/11/03 04:25:50 AM nsrd: write completion notice: Writing to volume
> NetApp.136 complete
> 03/11/03 04:25:59 AM nsrd: media info: suggest mounting
> acacia_bkp.aurora.com.171 on acacia_bkp.aurora.com for writing  to pool
> 'Default'
> 03/11/03 04:26:01 AM nsrd: media waiting event: Waiting for 1 writable
> volumes to backup pool 'Default' tape(s) on acacia_bkp.aurora.com
> 03/11/03 04:26:02 AM nsrd: \\.\Tape0 Eject operation in progress
> 03/11/03 04:26:36 AM nsrd: media info: loading volume
> acacia_bkp.aurora.com.171 into \\.\Tape0
> 03/11/03 04:26:48 AM nsrd: \\.\Tape0 Verify label operation in progress
> 03/11/03 04:27:34 AM nsrd: \\.\Tape0 Mount operation in progress
> 03/11/03 04:29:37 AM nsrd: media event cleared: Waiting for 1 writable
> volumes to backup pool 'Default' tape(s) on acacia_bkp.aurora.com
> 03/11/03 04:29:37 AM nsrd: acacia_bkp.aurora.com:index:sequoia saving to
> pool 'Default' (acacia_bkp.aurora.com.171)
> 03/11/03 04:36:52 AM nsrd: acacia_bkp.aurora.com:index:sequoia done saving
> to pool 'Default' (acacia_bkp.aurora.com.171) 3042 MB
> 03/11/03 04:37:05 AM nsrd: savegroup alert: NETAPP completed, 1 client(s)
> (sequoia Failed)
> * sequoia:/vol/vol0/home ! no output
> * sequoia:/vol/vol0/home 1 retry attempted
> * sequoia:/vol/vol0/home ! no output
> 03/11/03 04:37:05 AM nsrd: runq: NSR group NETAPP exited with return code 1.
> 03/11/03 04:37:54 AM nsrd: write completion notice: Writing to volume
> acacia_bkp.aurora.com.171 complete
> 
> 
> 
>   Anybody else had to deal with this?  What's a good place to start?  What
> sorts of things can cause this failure?
> 
> --
> Note: To sign off this list, send a "signoff networker" command via email
> to listserv AT listmail.temple DOT edu or visit the list's Web site at
> http://listmail.temple.edu/archives/networker.html where you can
> also view and post messages to the list.
> =*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=

--
Note: To sign off this list, send a "signoff networker" command via email
to listserv AT listmail.temple DOT edu or visit the list's Web site at
http://listmail.temple.edu/archives/networker.html where you can
also view and post messages to the list.
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=

<Prev in Thread] Current Thread [Next in Thread>