Amanda-Users

Re: got FAILED for no apparent reason

2005-07-12 09:30:43
Subject: Re: got FAILED for no apparent reason
From: Jon LaBadie <jon AT jgcomp DOT com>
To: amanda-users AT amanda DOT org
Date: Tue, 12 Jul 2005 09:16:03 -0400
On Tue, Jul 12, 2005 at 01:50:44PM +0200, Paul Bijnens wrote:
> Rodrigo Ventura wrote:
> >Hello. My backup system has been running smoothly for the last few
> >months. However today I got this FAILED dumps:
> >
> >
> >FAILURE AND STRANGE DUMP SUMMARY:
> >  gtisr      /usr lev 0 FAILED [Estimate timeout from gtisr]
> >  gtisr      /boot lev 0 FAILED [Estimate timeout from gtisr]
> >  gtisr      / lev 0 FAILED [Estimate timeout from gtisr]
> >[...]
> >
> >The debug logs at /tmp/amanda (in gtisr host) show no error, but their
> >presence show communication among server and client.
> >
> >Any suggestions on where should I look to debug this occurence?
> 
> Could it be that the estimate took too long and the amanda
> server did not wait anymore for a response for that client?
> What is etimeout in amanda.conf?
> If not specified, the default etimeout is 600 sec times number of DLEs
> for that host.

My sources (now a bit old) show a default of 300 sec.  Was that raised?


> How long did the estimate really take?   Have a look at the
> timestamps in the amandad.*debug files where the sendsize is handled.
> 
> A timeout could also happen when an intermediate firewall (or firewall
> on client and/or server itself) drops the UDP reply packet for the
> sendsize request.  The default expiration is usually 300 seconds (for
> iptables) or even 40 seconds (for FW1).
> 
> 
> -- 
> Paul Bijnens, Xplanation                            Tel  +32 16 397.511
> Technologielaan 21 bus 2, B-3001 Leuven, BELGIUM    Fax  +32 16 397.512
> http://www.xplanation.com/          email:  Paul.Bijnens AT xplanation DOT com
> ***********************************************************************
> * I think I've got the hang of it now:  exit, ^D, ^C, ^\, ^Z, ^Q, F6, *
> * quit,  ZZ, :q, :q!,  M-Z, ^X^C,  logoff, logout, close, bye,  /bye, *
> * stop, end, F3, ~., ^]c, +++ ATH, disconnect, halt,  abort,  hangup, *
> * PF4, F20, ^X^X, :D::D, KJOB, F14-f-e, F8-e,  kill -1 $$,  shutdown, *
> * init 0, kill -9 1, Alt-F4, Ctrl-Alt-Del, AltGr-NumLock, Stop-A, ... *
> * ...  "Are you sure?"  ...   YES   ...   Phew ...   I'm out          *
> ***********************************************************************
> 
> 
> 
>>> End of included message <<<

-- 
Jon H. LaBadie                  jon AT jgcomp DOT com
 JG Computing
 4455 Province Line Road        (609) 252-0159
 Princeton, NJ  08540-4322      (609) 683-7220 (fax)

<Prev in Thread] Current Thread [Next in Thread>