On 2006-05-30 10:30, Paul Duncan wrote:
Hello,
One of our filesystems is failing to get backed up and I am interested
in trying to ascertain why. The report entry is:
compaqdev2 /export/home lev 0 FAILED [data timeout]
In the amdump file I see the following suspicious entries. I get a
series of "driver-idle: no-diskspace" entries which span 3 hours:
driver: state time 1737.214 free kps: 254081 space: 2164824 taper: idle
idle-dumpers: 1 qlen tapeq: 0 runq: 26 roomq: 0 wakeup: 15 driver-idle:
no-diskspace
driver: state time 12480.205 free kps: 286720 space: 15482922 taper:
writing idle-dumpers: 5 qlen tapeq: 6 runq: 2 roomq: 0 wakeup: 86400
driver-idle: no-diskspace
The above means that Amanda did not start up another dumper because
the holdingdisk had all space reserved by other dumpers.
When a dumper needs more diskspace on the holdingdisk than it reserved
in the beginning, it asks driver with a command "RQ-MORE-DISK"? Do you
see that string in the logfile?
Then the filesystem dump fails over an hour later:
driver: result time 17127.984 from dumper0: FAILED 01-00054 [data timeout]
I would have a look in the sendbackup.*.debug file on the client and
see if some warning/error message is in there, and verify that the
client was still running at that time.
Could this also be just another symptom of the problem described here:
http://wiki.zmanda.com/index.php/Amdump:_mesg_read:_Connection_reset_by_peer
Why does the data timeout occur over an hour after disk space becomes
available? The dtimeout parameter in amanda.conf is set to 1800.
I think it is because the datatimeout is not triggered by the diskspace
but by something else.
--
Paul Bijnens, xplanation Technology Services Tel +32 16 397.511
Technologielaan 21 bus 2, B-3001 Leuven, BELGIUM Fax +32 16 397.512
http://www.xplanation.com/ email: Paul.Bijnens AT xplanation DOT com
***********************************************************************
* I think I've got the hang of it now: exit, ^D, ^C, ^\, ^Z, ^Q, ^^, *
* F6, quit, ZZ, :q, :q!, M-Z, ^X^C, logoff, logout, close, bye, /bye, *
* stop, end, F3, ~., ^]c, +++ ATH, disconnect, halt, abort, hangup, *
* PF4, F20, ^X^X, :D::D, KJOB, F14-f-e, F8-e, kill -1 $$, shutdown, *
* init 0, kill -9 1, Alt-F4, Ctrl-Alt-Del, AltGr-NumLock, Stop-A, ... *
* ... "Are you sure?" ... YES ... Phew ... I'm out *
***********************************************************************
|