Amanda-Users

Strange timeout errors..

2003-11-20 03:10:58
Subject: Strange timeout errors..
From: Hillel <hillel AT matat.health.gov DOT il>
To: amanda-users AT amanda DOT org
Date: Thu, 20 Nov 2003 10:08:08 +0200 (IST)
Hi all,

I've been backing up about 12 linux and solaris boxes with amanda 2.4.2
for a years now. Finally i got new ibm eservers to replace the old pcs 2
some of my linux servers were on and the move was wonderful, that is,
except for amanda.
2 of the servers moved are backup up fine with no changes required,
however 2 others are giving me some grief.

amcheck runs fine on all of them. (no errors detected)

when amdump runs on the amanda server the process seems to start on the
clients, and then time out. The logs are:

The server shows:

FAILURE AND STRANGE DUMP SUMMARY:
  mail-ex-1. /var/spool/exim lev 0 FAILED [Request to mail-ex-1.health.gov.il 
timed out.]
  mail-ex-1. /boot lev 0 FAILED [Request to mail-ex-1.health.gov.il timed out.]
  mail-ex-1. / lev 0 FAILED [Request to mail-ex-1.health.gov.il timed out.]
  mail-ex-2. /var/spool/exim lev 0 FAILED [Request to mail-ex-2.health.gov.il 
timed out.]
  mail-ex-2. /boot lev 0 FAILED [Request to mail-ex-2.health.gov.il timed out.]
  mail-ex-2. / lev 0 FAILED [Request to mail-ex-2.health.gov.il timed out.]

and the clients show:

amandad: debug 1 pid 11967 ruid 502 euid 502 start time Wed Nov 19
17:41:02 2003
amandad: version 2.4.2p2
amandad: build: VERSION="Amanda-2.4.2p2"
amandad:        BUILT_DATE="Sun Aug 24 10:53:13 IDT 2003"
amandad:        BUILT_MACH="Linux mail-ex-1 2.4.20-8smp #1 SMP Thu Mar 13
17:45:54 EST 2003 i686 i686 i3
86 GNU/Linux"
amandad:        CC="gcc"
amandad: paths: bindir="/usr/local/bin" sbindir="/usr/local/sbin"
amandad:        libexecdir="/usr/local/libexec" mandir="/usr/local/man"
amandad:        AMANDA_TMPDIR="/tmp/amanda" AMANDA_DBGDIR="/tmp/amanda"
amandad:        CONFIG_DIR="/usr/local/etc/amanda" DEV_PREFIX="/dev/"
amandad:        RDEV_PREFIX="/dev/" DUMP="/sbin/dump"
amandad:        RESTORE="/sbin/restore" SAMBA_CLIENT="/usr/bin/smbclient"
amandad:        GNUTAR="/bin/gtar" COMPRESS_PATH="/bin/gzip"
amandad:        UNCOMPRESS_PATH="/bin/gzip" MAILER="/usr/bin/Mail"
amandad:        listed_incr_dir="/usr/local/var/amanda/gnutar-lists"
amandad: defs:  DEFAULT_SERVER="mail-ex-1" DEFAULT_CONFIG="DailySet1"
amandad:        DEFAULT_TAPE_SERVER="mail-ex-1"
amandad:        DEFAULT_TAPE_DEVICE="/dev/null" HAVE_MMAP HAVE_SYSVSHM
amandad:        LOCKING=POSIX_FCNTL SETPGRP_VOID DEBUG_CODE
amandad:        AMANDA_DEBUG_DAYS=4 BSD_SECURITY USE_AMANDAHOSTS
amandad:        CLIENT_LOGIN="amanda" FORCE_USERID HAVE_GZIP
amandad:        COMPRESS_SUFFIX=".gz" COMPRESS_FAST_OPT="--fast"
amandad:        COMPRESS_BEST_OPT="--best" UNCOMPRESS_OPT="-dc"
got packet:
--------
Amanda 2.4 REQ HANDLE 002-A0C00708 SEQ 1069256707
SECURITY USER amanda
SERVICE sendsize
OPTIONS maxdumps=1;hostname=mail-ex-1.health.gov.il;
GNUTAR /var/spool/exim 0 1970:1:1:0:0:0 -1
GNUTAR /var/spool/exim 1 2003:11:6:14:55:6 -1
GNUTAR /boot 0 1970:1:1:0:0:0 -1
GNUTAR /boot 1 2003:11:12:18:58:10 -1
GNUTAR / 0 1970:1:1:0:0:0 -1
GNUTAR / 1 2003:11:19:3:1:0 -1
--------

sending ack:
----
Amanda 2.4 ACK HANDLE 002-A0C00708 SEQ 1069256707
----

bsd security: remote host amandasrv.matat.health.gov.il user amanda local user 
amanda
amandahosts security check passed
amandad: running service "/usr/local/libexec/sendsize"
amandad: sending REP packet:
----
Amanda 2.4 REP HANDLE 002-A0C00708 SEQ 1069256707
OPTIONS maxdumps=1;
/ 0 SIZE 1346680
/ 1 SIZE 18660
/boot 0 SIZE 10870
/boot 1 SIZE 10
/var/spool/exim 0 SIZE 15810
/var/spool/exim 1 SIZE 13280
----

amandad: dgram_recv: timeout after 10 seconds
amandad: waiting for ack: timeout, retrying
amandad: dgram_recv: timeout after 10 seconds
amandad: waiting for ack: timeout, retrying
amandad: dgram_recv: timeout after 10 seconds
amandad: waiting for ack: timeout, retrying
amandad: dgram_recv: timeout after 10 seconds
amandad: waiting for ack: timeout, retrying
amandad: dgram_recv: timeout after 10 seconds
amandad: waiting for ack: timeout, giving up!
amandad: pid 11967 finish time Wed Nov 19 17:43:07 2003


I would greatly appreciate anybodys help on this matter..

Thanks ahead,

Hillel Greenberg
EMS/Unix Administrator
Ministry of Health, Israel
hillel.g AT moh.health.gov DOT il

<Prev in Thread] Current Thread [Next in Thread>
  • Strange timeout errors.., Hillel <=