Amanda-Users

Re: Data Timeout

2006-05-31 10:59:43
Subject: Re: Data Timeout
From: Paul Bijnens <paul.bijnens AT xplanation DOT com>
To: Paul Duncan <Paul.Duncan AT yolus DOT com>
Date: Wed, 31 May 2006 16:53:27 +0200
On 2006-05-30 10:30, Paul Duncan wrote:
Hello,
One of our filesystems is failing to get backed up and I am interested in trying to ascertain why. The report entry is: compaqdev2 /export/home lev 0 FAILED [data timeout]

In the amdump file I see the following suspicious entries. I get a series of "driver-idle: no-diskspace" entries which span 3 hours:

driver: state time 1737.214 free kps: 254081 space: 2164824 taper: idle idle-dumpers: 1 qlen tapeq: 0 runq: 26 roomq: 0 wakeup: 15 driver-idle: no-diskspace

driver: state time 12480.205 free kps: 286720 space: 15482922 taper: writing idle-dumpers: 5 qlen tapeq: 6 runq: 2 roomq: 0 wakeup: 86400 driver-idle: no-diskspace

The above means that Amanda did not start up another dumper because
the holdingdisk had all space reserved by other dumpers.

When a dumper needs more diskspace on the holdingdisk than it reserved
in the beginning, it asks driver with a command "RQ-MORE-DISK"? Do you
see that string in the logfile?


Then the filesystem dump fails over an hour later:

driver: result time 17127.984 from dumper0: FAILED 01-00054 [data timeout]

I would have a look in the sendbackup.*.debug file on the client and
see if some warning/error message is in there, and verify that the
client was still running at that time.

Could this also be just another symptom of the problem described here:

http://wiki.zmanda.com/index.php/Amdump:_mesg_read:_Connection_reset_by_peer


Why does the data timeout occur over an hour after disk space becomes available? The dtimeout parameter in amanda.conf is set to 1800.

I think it is because the datatimeout is not triggered by the diskspace
but by something else.


--
Paul Bijnens, xplanation Technology Services        Tel  +32 16 397.511
Technologielaan 21 bus 2, B-3001 Leuven, BELGIUM    Fax  +32 16 397.512
http://www.xplanation.com/          email:  Paul.Bijnens AT xplanation DOT com
***********************************************************************
* I think I've got the hang of it now:  exit, ^D, ^C, ^\, ^Z, ^Q, ^^, *
* F6, quit, ZZ, :q, :q!, M-Z, ^X^C, logoff, logout, close, bye, /bye, *
* stop, end, F3, ~., ^]c, +++ ATH, disconnect, halt,  abort,  hangup, *
* PF4, F20, ^X^X, :D::D, KJOB, F14-f-e, F8-e,  kill -1 $$,  shutdown, *
* init 0, kill -9 1, Alt-F4, Ctrl-Alt-Del, AltGr-NumLock, Stop-A, ... *
* ...  "Are you sure?"  ...   YES   ...   Phew ...   I'm out          *
***********************************************************************


<Prev in Thread] Current Thread [Next in Thread>
  • Data Timeout, Paul Duncan
    • Re: Data Timeout, Paul Bijnens <=