Amanda-Users

Re: Fw: Intermittent timeouts

2007-12-12 10:44:59
Subject: Re: Fw: Intermittent timeouts
From: Paul Bijnens <Paul.Bijnens AT xplanation DOT com>
To: "amanda-users >> Amanda List" <amanda-users AT amanda DOT org>
Date: Wed, 12 Dec 2007 16:36:33 +0100

The debug you sent is not enough.
There should a debug file for each request from the server.
The one you sent is the first request, a "noop", which the server
uses to learn the capabilities of the client.
The next request should be a "sendsize" request packet.
Show that one.


On 2007-12-12 13:36, Keith Edmunds wrote:
Anyone have any suggestions to help move this forward? Thanks!

Begin forwarded message:

Date: Thu, 6 Dec 2007 15:44:10 +0000
From: Keith Edmunds <kae AT midnighthax DOT com>
To: amanda-users AT amanda DOT org
Subject: Intermittent timeouts


Hi all

We're backing up ten hosts using Amanda; one of them intermittently gets
timeouts while all others are OK.

The errors look like:

[in the log file]
FAIL planner bastion /dev/da2s1c 20071205 0 [Request to bastion timed out.]

[in the amdump file]
planner: time 30.231: error result for host bastion disk /dev/da2s1c:
Request to bastion timed out.

There are six partitions being backed up on the system in question; if I
comment out one or more the disklist entries, the backups run
successfully. They've also run successfully in the past on all six
partitions (this problem started a couple of months ago).

The last part of the debug log on the client server looks like this:

--------------------------------------------------------------------------------
amandad: time 0.028: got packet:
--------
Amanda 2.4 REQ HANDLE 002-E83A0280 SEQ 1196885402
SECURITY USER amanda
SERVICE noop
OPTIONS features=fffffeff9ffe0f;
--------

amandad: time 0.029: sending ack:
----
Amanda 2.4 ACK HANDLE 002-E83A0280 SEQ 1196885402
----

amandad: time 0.030: bsd security: remote host fermi user amanda local
user operator amandad: time 0.033: amandahosts security check passed
amandad: time 0.033: running service "noop"
amandad: time 0.033: sending REP packet:
----
Amanda 2.4 REP HANDLE 002-E83A0280 SEQ 1196885402
OPTIONS features=fffffeff9ffe0f;
----

amandad: time 0.034: got packet:
----
Amanda 2.4 ACK HANDLE 002-E83A0280 SEQ 1196885402
----

amandad: time 0.034: pid 89078 finish time Wed Dec  5 20:10:00 2007
--------------------------------------------------------------------------------

I've increased etimeout from 1800 to 3600 but there's been no improvement.

I'm looking for any ideas to either resolve this problem or to gather more
information to find the cause.

Thanks,
Keith




--
Paul Bijnens, xplanation Technology Services        Tel  +32 16 397.511
Technologielaan 21 bus 2, B-3001 Leuven, BELGIUM    Fax  +32 16 397.512
http://www.xplanation.com/          email:  Paul.Bijnens AT xplanation DOT com
***********************************************************************
* I think I've got the hang of it now:  exit, ^D, ^C, ^\, ^Z, ^Q, ^^, *
* F6, quit, ZZ, :q, :q!, M-Z, ^X^C, logoff, logout, close, bye, /bye, *
* stop, end, F3, ~., ^]c, +++ ATH, disconnect, halt,  abort,  hangup, *
* PF4, F20, ^X^X, :D::D, KJOB, F14-f-e, F8-e,  kill -1 $$,  shutdown, *
* init 0, kill -9 1, Alt-F4, Ctrl-Alt-Del, AltGr-NumLock, Stop-A, ... *
* ...  "Are you sure?"  ...   YES   ...   Phew ...   I'm out          *
***********************************************************************

<Prev in Thread] Current Thread [Next in Thread>