Amanda-Users

Estimate timeout

2005-09-20 05:16:45
Subject: Estimate timeout
From: "Tommy Eriksen" <te AT rackhosting DOT com>
To: <amanda-users AT amanda DOT org>
Date: Tue, 20 Sep 2005 10:55:18 +0200
Hey,

I have a rather strange problem.
I had to restore a complete server from backup recently, no problem, everything 
went smoothly.
However, after this, I can't seem to get a fresh backup of it. I've tried 
everything from reinstalling amanda to changing the machine's hostname (both 
the machines real hostname and the one in amanda), but still I get this in my 
daily report:
  dc104      ar0s1a lev 0 FAILED [Estimate timeout from dc104]

I've got 115 entries in my disklist (on some 60 hosts) and this is the only one 
I can't get to work.
There doesn't seem to be any network problems between the amanda server and the 
client either.

This does look like a networking problem, but the machines can communicate 
freely and without any problems.

In amandad.debug on the client I get:
-bash-2.05b# cat amandad.20050914010102000.debug
amandad: debug 1 pid 20112 ruid 2 euid 2: start at Wed Sep 14 01:01:02 2005
amandad: version 2.4.5
amandad: build: VERSION="Amanda-2.4.5"
amandad:        BUILT_DATE="Thu Aug 25 17:46:51 CEST 2005"
amandad:        BUILT_MACH="FreeBSD tlnordic.moduleweb.net 4.9-STABLE FreeBSD 
4.9-STABLE #0: Mon Jan 5 23:35:10 CET 2004 root AT tlnordic.moduleweb DOT 
net:/usr/obj/usr/src/sys/GENERIC i386"
amandad:        CC="cc"
amandad:        CONFIGURE_COMMAND="'./configure' 
'--libexecdir=/usr/local/libexec/amanda' '--with-amandahosts' '--with-fqdn' 
'--with-dump-honor-nodump' '--with-buffered-dump' '--without-server' 
'--disable-libtool' '--prefix=/usr/local' '--with-user=operator' 
'--with-group=operator' 
'--with-gnutar-listdir=/usr/local/var/amanda/gnutar-lists' 
'--with-index-server=eclipse.rackhosting.com' 
'--with-tape-server=eclipse.rackhosting.com' '--with-config=ModuleWeb' 
'--prefix=/usr/local' '--build=i386-portbld-freebsd4.9'"
amandad: paths: bindir="/usr/local/bin" sbindir="/usr/local/sbin"
amandad:        libexecdir="/usr/local/libexec/amanda"
amandad:        mandir="/usr/local/man" AMANDA_TMPDIR="/tmp/amanda"
amandad:        AMANDA_DBGDIR="/tmp/amanda"
amandad:        CONFIG_DIR="/usr/local/etc/amanda" DEV_PREFIX="/dev/"
amandad:        RDEV_PREFIX="/dev/r" DUMP="/sbin/dump"
amandad:        RESTORE="/sbin/restore" VDUMP=UNDEF VRESTORE=UNDEF
amandad:        XFSDUMP=UNDEF XFSRESTORE=UNDEF VXDUMP=UNDEF VXRESTORE=UNDEF
amandad:        SAMBA_CLIENT=UNDEF GNUTAR="/usr/bin/tar"
amandad:        COMPRESS_PATH="/usr/bin/gzip"
amandad:        UNCOMPRESS_PATH="/usr/bin/gzip" LPRCMD="/usr/bin/lpr"
amandad:        MAILER="/usr/bin/Mail"
amandad:        listed_incr_dir="/usr/local/var/amanda/gnutar-lists"
amandad: defs:  DEFAULT_SERVER="eclipse.rackhosting.com"
amandad:        DEFAULT_CONFIG="ModuleWeb"
amandad:        DEFAULT_TAPE_SERVER="eclipse.rackhosting.com"
amandad:        DEFAULT_TAPE_DEVICE="/dev/null" HAVE_MMAP HAVE_SYSVSHM
amandad:        LOCKING=POSIX_FCNTL DEBUG_CODE AMANDA_DEBUG_DAYS=4
amandad:        BSD_SECURITY USE_AMANDAHOSTS CLIENT_LOGIN="operator"
amandad:        FORCE_USERID HAVE_GZIP COMPRESS_SUFFIX=".gz"
amandad:        COMPRESS_FAST_OPT="--fast" COMPRESS_BEST_OPT="--best"
amandad:        UNCOMPRESS_OPT="-dc"
amandad: time 0.000: got packet:
--------
Amanda 2.4 REQ HANDLE 00F-80930508 SEQ 1126652514
SECURITY USER operator
SERVICE sendsize
OPTIONS features=fffffeff9ffe0f;maxdumps=1;hostname=dc104;
DUMP ar0s1a 0 1970:1:1:0:0:0 -1 OPTIONS |;auth=bsd;compress-fast;index;
DUMP ar0s1a 1 2005:8:10:12:30:11 -1 OPTIONS |;auth=bsd;compress-fast;index;
--------

amandad: time 0.000: sending ack:
----
Amanda 2.4 ACK HANDLE 00F-80930508 SEQ 1126652514
----

amandad: time 0.001: bsd security: remote host eclipse.rackhosting.com user 
operator local user operator
amandad: time 0.001: amandahosts security check passed
amandad: time 0.001: running service "/usr/local/libexec/amanda/sendsize"
amandad: time 599.378: got packet:
----
Amanda 2.4 REQ HANDLE 00F-80930508 SEQ 1126652514
SECURITY USER operator
SERVICE sendsize
OPTIONS features=fffffeff9ffe0f;maxdumps=1;hostname=dc104;
DUMP ar0s1a 0 1970:1:1:0:0:0 -1 OPTIONS |;auth=bsd;compress-fast;index;
DUMP ar0s1a 1 2005:8:10:12:30:11 -1 OPTIONS |;auth=bsd;compress-fast;index;
----

amandad: time 599.378: received dup P_REQ packet, ACKing it
amandad: time 599.378: sending ack:
----
Amanda 2.4 ACK HANDLE 00F-80930508 SEQ 1126652514
----

amandad: time 1199.936: got packet:
----
Amanda 2.4 REQ HANDLE 00F-80930508 SEQ 1126652514
SECURITY USER operator
SERVICE sendsize
OPTIONS features=fffffeff9ffe0f;maxdumps=1;hostname=dc104;
DUMP ar0s1a 0 1970:1:1:0:0:0 -1 OPTIONS |;auth=bsd;compress-fast;index;
DUMP ar0s1a 1 2005:8:10:12:30:11 -1 OPTIONS |;auth=bsd;compress-fast;index;
----

amandad: time 1199.936: received dup P_REQ packet, ACKing it
amandad: time 1199.936: sending ack:
----
Amanda 2.4 ACK HANDLE 00F-80930508 SEQ 1126652514
----

amandad: time 2951.811: sending REP packet:
----
Amanda 2.4 REP HANDLE 00F-80930508 SEQ 1126652514
OPTIONS features=fffffeff9ffe7f;
ar0s1a 0 SIZE 30567356
ar0s1a 1 SIZE 17959269
----

amandad: time 2961.819: dgram_recv: timeout after 10 seconds
amandad: time 2961.819: waiting for ack: timeout, retrying
amandad: time 2971.827: dgram_recv: timeout after 10 seconds
amandad: time 2971.827: waiting for ack: timeout, retrying
amandad: time 2981.835: dgram_recv: timeout after 10 seconds
amandad: time 2981.835: waiting for ack: timeout, retrying
amandad: time 2991.843: dgram_recv: timeout after 10 seconds
amandad: time 2991.843: waiting for ack: timeout, retrying
amandad: time 3001.851: dgram_recv: timeout after 10 seconds
amandad: time 3001.851: waiting for ack: timeout, giving up!
amandad: time 3001.851: pid 20112 finish time Wed Sep 14 01:51:04 2005

In sendsize.debug:
sendsize: debug 1 pid 20113 ruid 2 euid 2: start at Wed Sep 14 01:01:02 2005
sendsize: version 2.4.5
sendsize[20113]: time 0.204: waiting for any estimate child: 1 running
sendsize[20121]: time 0.204: calculating for amname 'ar0s1a', dirname '/', 
spindle -1
sendsize[20121]: time 0.204: getting size via dump for ar0s1a level 0
sendsize[20121]: time 0.204: calculating for device '/dev/ar0s1a' with 'ufs'
sendsize[20121]: time 0.204: running "/sbin/dump 0Shsf 0 1048576 - /dev/ar0s1a"
sendsize[20121]: time 0.205: running /usr/local/libexec/amanda/killpgrp
sendsize[20121]: time 0.344:   DUMP: Date of this level 0 dump: Wed Sep 14 
01:01:03 2005
sendsize[20121]: time 0.344:   DUMP: Date of last level 0 dump: the epoch
sendsize[20121]: time 0.345:   DUMP: Dumping /dev/ar0s1a (/) to standard output
sendsize[20121]: time 1.412:   DUMP: mapping (Pass I) [regular files]
sendsize[20121]: time 320.249:   DUMP: mapping (Pass II) [directories]
sendsize[20121]: time 320.250:   DUMP: estimated 30567356 tape blocks.
sendsize[20121]: time 320.250: .....
sendsize[20121]: estimate time for ar0s1a level 0: 320.045
sendsize[20121]: estimate size for ar0s1a level 0: 30567356 KB
sendsize[20121]: time 320.250: asking killpgrp to terminate
sendsize[20121]: time 321.252: getting size via dump for ar0s1a level 1
sendsize[20121]: time 321.252: calculating for device '/dev/ar0s1a' with 'ufs'
sendsize[20121]: time 321.252: running "/sbin/dump 1Shsf 0 1048576 - 
/dev/ar0s1a"
sendsize[20121]: time 321.253: running /usr/local/libexec/amanda/killpgrp
sendsize[20121]: time 321.258:   DUMP: Date of this level 1 dump: Wed Sep 14 
01:06:24 2005
sendsize[20121]: time 321.259:   DUMP: Date of last level 0 dump: Wed Aug 10 
14:30:11 2005
sendsize[20121]: time 321.259:   DUMP: Dumping /dev/ar0s1a (/) to standard 
output
sendsize[20121]: time 328.261:   DUMP: mapping (Pass I) [regular files]
sendsize[20121]: time 1462.879:   DUMP: mapping (Pass II) [directories]
sendsize[20121]: time 2950.657:   DUMP: estimated 17959269 tape blocks.
sendsize[20121]: time 2950.658: .....
sendsize[20121]: estimate time for ar0s1a level 1: 2629.405
sendsize[20121]: estimate size for ar0s1a level 1: 17959269 KB
sendsize[20121]: time 2950.658: asking killpgrp to terminate
sendsize[20121]: time 2951.660: done with amname 'ar0s1a', dirname '/', spindle 
-1
sendsize[20113]: time 2951.660: child 20121 terminated normally
sendsize: time 2951.660: pid 20113 finish time Wed Sep 14 01:50:14 2005

Med venlig hilsen / Best regards
Tommy Eriksen - Chief Technical Officer

 Rackhosting.com kan fungere som dit firma's eksterne IT-afdeling.

- Usikker på om backuppen fungerer når du skal bruge den?
- Træt af spam, virus og ustabil email?
- Træt af dyre telefonregninger og vanskelige telefonsystemer? 

Vidste du at du kan outsource det hele til os, i sikre facili-
teter med professionelle medarbejdere der passer på DIG døgnet rundt. Så får du 
den service og sikkerhed du har behov for til at koncentrere dig om din 
kerneforretning. Det er mere effektivt og billigere end at dedikere løn og dyr 
teknik til en intern IT afdeling.

 Og husk, vi går ikke på kompromis med serviceniveauet, du kan altid ringe uden 
at være på hold i halve timer ad gangen.

Ring nu og aftal et møde med en af vores konsulenter på telefon 70 22 33 04



<Prev in Thread] Current Thread [Next in Thread>