Amanda-Users

amdump stuck - possible timing problem

2004-05-29 13:22:35
Subject: amdump stuck - possible timing problem
From: "David Trusty" <dwtrusty AT hotmail DOT com>
To: amanda-users AT amanda DOT org
Date: Sat, 29 May 2004 12:12:27 -0500
Hi,

I have an amdump which is stuck.

Here is what I am seeing:

===output of ps===
amanda 9110 9109 0 May28 ? 00:00:00 /bin/sh /usr/local/sbin/amdump Daily amanda 9120 9110 0 May28 ? 00:00:00 /usr/local/libexec/driver Daily
amanda    9122  9121  0 May28 ?        00:00:01 taper Daily
amanda    9125  9120  0 May28 ?        00:00:00 dumper1 Daily
amanda    9126  9120  0 May28 ?        00:00:00 dumper2 Daily
amanda    9127  9120  0 May28 ?        00:00:00 dumper3 Daily
amanda    9341  9124  0 10:08 ?        00:00:00 [gzip <defunct>]
amanda    9121  9120  0 May28 ?        00:00:01 taper Daily
amanda    9124  9120  0 May28 ?        00:00:01 dumper0 Daily

====excerpt from amstatus=====
localhost:/mnt/morpheus-home/amanda 1 1390k wait for dumping localhost:/mnt/morpheus-home/dtrusty 0 6795460k wait for dumping localhost:/mnt/morpheus-home/jeff 1 460k wait for dumping localhost:/mnt/morpheus-home/root 1 520k dumping to tape (10:08:37) localhost:/mnt/morpheus-home/shuping 0 0k finished (4:46:20) localhost:/mnt/morpheus-home2/txu 1 60k finished (20:44:08) localhost:/mnt/progress-share 0 planner: [disk /mnt/progress-share, all estimate failed]

SUMMARY          part      real  estimated
                          size       size
partition       : 112
estimated       : 111            238961388k
flush           :   0         0k
failed          :   1                    0k           (  0.00%)
wait for dumping:  32            238565010k           ( 99.83%)
dumping to tape :   1                  520k           (  0.00%)
dumping         :   0         0k         0k (  0.00%) (  0.00%)
dumped          :  79     95550k    396378k ( 24.11%) (  0.04%)
wait for writing:   0         0k         0k (  0.00%) (  0.00%)
wait to flush   :   0         0k         0k (100.00%) (  0.00%)
writing to tape :   0         0k         0k (  0.00%) (  0.00%)
failed to tape  :   0         0k         0k (  0.00%) (  0.00%)
taped           :  78     95550k    395858k ( 24.14%) (  0.04%)
 tape 1        : 236     95550k   4940448k (  0.20%) DailySet1-008
3 dumpers idle  : not-idle
taper writing, tapeq: 0
network free kps:      1986
holding space   :   5242880k (100.00%)
dumper0 busy   : 13:23:30  ( 99.32%)
  taper busy   : 13:28:59  (100.00%)
0 dumpers busy :  0:00:00  (  0.00%)
1 dumper busy : 13:29:00 (100.00%) not-idle: 13:29:00 (100.00%)

===output from most recent sendbackup debug file in /tmp/amanda====
sendbackup: debug 1 pid 9342 ruid 33 euid 33: start at Sat May 29 10:08:38 2004
/usr/local/libexec/sendbackup: version 2.4.4p2
 parsed request as: program `GNUTAR'
                    disk `/mnt/morpheus-home/root'
                    device `/mnt/morpheus-home/root'
                    level 1
                    since 2004:5:25:2:1:16
                    options `|;bsd-auth;index;'
sendbackup: try_socksize: send buffer size is 65536
sendbackup: time 0.001: stream_server: waiting for connection: 0.0.0.0.56630
sendbackup: time 0.001: stream_server: waiting for connection: 0.0.0.0.56631
sendbackup: time 0.001: stream_server: waiting for connection: 0.0.0.0.56632
sendbackup: time 0.001: waiting for connect on 56630, then 56631, then 56632
sendbackup: time 30.002: stream_accept: timeout after 30 seconds
sendbackup: time 30.002: timeout on data port 56630
sendbackup: time 60.002: stream_accept: timeout after 30 seconds
sendbackup: time 60.002: timeout on mesg port 56631
sendbackup: time 90.002: stream_accept: timeout after 30 seconds
sendbackup: time 90.002: timeout on index port 56632
sendbackup: time 90.002: pid 9342 finish time Sat May 29 10:10:08 2004

===output from most recent amandad debug file in /tmp/amanda===
amandad: debug 1 pid 9340 ruid 33 euid 33: start at Sat May 29 10:08:38 2004
amandad: version 2.4.4p2
amandad: build: VERSION="Amanda-2.4.4p2"
amandad:        BUILT_DATE="Tue Apr 13 22:15:46 CDT 2004"
amandad: BUILT_MACH="Linux genomics.swmed.edu 2.4.20-20.9 #1 Mon Aug 18 11:45:58 EDT 2003 i686 i686 i386 GNU/Linux"
amandad:        CC="gcc"
amandad: CONFIGURE_COMMAND="'./configure' '--with-user=amanda' '--with-group=disk'"
amandad: paths: bindir="/usr/local/bin" sbindir="/usr/local/sbin"
amandad:        libexecdir="/usr/local/libexec" mandir="/usr/local/man"
amandad:        AMANDA_TMPDIR="/tmp/amanda" AMANDA_DBGDIR="/tmp/amanda"
amandad:        CONFIG_DIR="/usr/local/etc/amanda" DEV_PREFIX="/dev/"
amandad:        RDEV_PREFIX="/dev/" DUMP="/sbin/dump"
amandad:        RESTORE="/sbin/restore" VDUMP=UNDEF VRESTORE=UNDEF
amandad:        XFSDUMP=UNDEF XFSRESTORE=UNDEF VXDUMP=UNDEF VXRESTORE=UNDEF
amandad:        SAMBA_CLIENT="/usr/bin/smbclient" GNUTAR="/bin/gtar"
amandad:        COMPRESS_PATH="/bin/gzip" UNCOMPRESS_PATH="/bin/gzip"
amandad:        LPRCMD="/usr/bin/lpr" MAILER="/usr/bin/Mail"
amandad:        listed_incr_dir="/usr/local/var/amanda/gnutar-lists"
amandad: defs:  DEFAULT_SERVER="genomics.swmed.edu"
amandad:        DEFAULT_CONFIG="DailySet1"
amandad:        DEFAULT_TAPE_SERVER="genomics.swmed.edu"
amandad:        DEFAULT_TAPE_DEVICE="/dev/null" HAVE_MMAP HAVE_SYSVSHM
amandad:        LOCKING=POSIX_FCNTL SETPGRP_VOID DEBUG_CODE
amandad:        AMANDA_DEBUG_DAYS=4 BSD_SECURITY USE_AMANDAHOSTS
amandad:        CLIENT_LOGIN="amanda" FORCE_USERID HAVE_GZIP
amandad:        COMPRESS_SUFFIX=".gz" COMPRESS_FAST_OPT="--fast"
amandad:        COMPRESS_BEST_OPT="--best" UNCOMPRESS_OPT="-dc"
amandad: time 0.000: got packet:
--------
Amanda 2.4 REQ HANDLE 000-70E30608 SEQ 1085792638
SECURITY USER amanda
SERVICE sendbackup
OPTIONS features=fffffeff9ffe0f;hostname=localhost;
GNUTAR /mnt/morpheus-home/root  1 2004:5:25:2:1:16 OPTIONS |;bsd-auth;index;
--------

amandad: time 0.000: sending ack:
----
Amanda 2.4 ACK HANDLE 000-70E30608 SEQ 1085792638
----

amandad: time 0.001: bsd security: remote host genomics.swmed.edu user amanda local user amanda
amandad: time 0.001: amandahosts security check passed
amandad: time 0.001: running service "/usr/local/libexec/sendbackup"
amandad: time 0.009: got packet:
----
Amanda 2.4 ACK HANDLE 000-70E30608 SEQ 1085792638
----

amandad: time 0.009: it is not a P_REQ, ignoring it
amandad: time 0.010: sending REP packet:
----
Amanda 2.4 REP HANDLE 000-70E30608 SEQ 1085792638
CONNECT DATA 56630 MESG 56631 INDEX 56632
OPTIONS features=fffffeff9ffe0f;
----

amandad: time 10.010: dgram_recv: timeout after 10 seconds
amandad: time 10.010: waiting for ack: timeout, retrying
amandad: time 20.010: dgram_recv: timeout after 10 seconds
amandad: time 20.010: waiting for ack: timeout, retrying
amandad: time 30.010: dgram_recv: timeout after 10 seconds
amandad: time 30.010: waiting for ack: timeout, retrying
amandad: time 40.010: dgram_recv: timeout after 10 seconds
amandad: time 40.010: waiting for ack: timeout, retrying
amandad: time 50.010: dgram_recv: timeout after 10 seconds
amandad: time 50.010: waiting for ack: timeout, giving up!
amandad: time 50.010: pid 9340 finish time Sat May 29 10:09:28 2004


Any ideas?  Need any more info?

Thanks,

David

_________________________________________________________________
Stop worrying about overloading your inbox - get MSN Hotmail Extra Storage! http://join.msn.click-url.com/go/onm00200362ave/direct/01/


<Prev in Thread] Current Thread [Next in Thread>
  • amdump stuck - possible timing problem, David Trusty <=