Amanda-Users

RE: signal 13 (PIPE) error.

2005-03-14 20:09:50
Subject: RE: signal 13 (PIPE) error.
From: donald.ritchey AT exeloncorp DOT com
To: norstead AT ns.sympatico DOT ca, amanda-users AT amanda DOT org
Date: Mon, 14 Mar 2005 18:55:37 -0600
Bruce:

I don't want you to feel like we are all jumping on you, but...

First:

I noticed that you have used localhost (127.0.0.1) as the address 
for your network connections, and as the archives will illustrate 
in incredible detail, we generally consider that a Bad Idea (Tm).

Second:

Removing the holding disk from the picture actually makes the picture
more murky, not less.  If you have a holding disk, then the backup 
of the data happens in two parts, separating the dumper process of
getting the data off of the disk to be backed up and onto the holding
disk, and then the taper process moves the backed up data from
the holding disk to the tape drive.  Combining the two obscures 
where the actual problem is:  in the dumper or in the taper.
My natural instinct is to look at the tape drive and the taper
portion of the process, since that is a more mechanically driven 
process and past history (mine) indicates that most of the errors
that I have encountered have been there.

Repeat the experiment, correcting these two issues and I think the
picture will get a little clearer.

Best wishes,

Don
Donald L. (Don) Ritchey
Information Technology
Exelon Corporation

-----Original Message-----
From: norstead AT ns.sympatico DOT ca [mailto:norstead AT ns.sympatico DOT ca]
Sent: Monday, March 14, 2005 4:37 PM
To: amanda-users AT amanda DOT org
Subject: signal 13 (PIPE) error.



My last dozen or so emails to amanda-users have gone into a black
hole, I'll try this from another network under another subject.


So let's try and walk (crawl?) before we run.

I've set things up with only one small disk to be backed up, set
dumpcycle 0 and removed the holding disk and tape changer from the
config.  It's still failing.  There is a signal 13 (PIPE) error in
sendbackup.debug.

The amanda log shows:
  INFO taper tape Norstead01 kb 192 fm 1 writing file: Input/output error
  FAIL dumper alsike alsike_boot 20050311 0 ["data write: Connection reset
by peer"]

The amdump log shows:
 taper : writing end marker. [Norstead01 ERR kb 192 fm 1]
 dumper : kill compress command
 driver : result time 18.083 from dumper0: FAILED 01-00002 ["data write:
Connection reset by peer"]
 driver : result time 18.084 from taper: TAPE-ERROR 00-00001 [writing file:
Input/output error]

The sendbackup.20050311152708.debug shows:
 sendbackup : time 9.062: error [/bin/tar got signal 13]

Does anybody know what this means?

regards :-)
BruceS


 - - - - - - - - - /var/log/amanda/Norstead/log.20050311.0 - - - - - - - - -

> DISK planner alsike alsike_boot
> START planner date 20050311
> WARNING planner Last full dump of alsike:alsike_boot on tape  overwritten
in 1 run.
> START driver date 20050311
> FINISH planner date 20050311
> STATS driver startup time 0.029
> START taper datestamp 20050311 label Norstead01 tape 0

   v
> INFO taper tape Norstead01 kb 192 fm 1 writing file: Input/output error
> FAIL dumper alsike alsike_boot 20050311 0 ["data write: Connection reset
by peer"]
   ^

>   sendbackup: start [alsike:alsike_boot level 0]
>   sendbackup: info BACKUP=/bin/tar
>   sendbackup: info RECOVER_CMD=/bin/tar -f... -
>   sendbackup: info end
> FAIL taper alsike alsike_boot 20050311 0 [out of tape]
> ERROR taper no-tape [[writing file: Input/output error]]
> FAIL driver alsike alsike_boot 20050311 0 [dump to tape failed]
> FINISH driver date 20050311 time 18.084





 - - - - - - - - - - /var/log/amanda/Norstead/amdump.1 - - - - - - - - - - 

> amdump: start at Fri Mar 11 15:26:59 GMT 2005
> amdump: datestamp 20050311
> planner: pid 16003 executable /usr/lib/amanda/planner version 2.4.4p2
> planner: build: VERSION="Amanda-2.4.4p2"
> planner:        BUILT_DATE="Fri Aug 13 18:32:08 UTC 2004"
> planner:        BUILT_MACH="Linux mcmurdo 2.4.27-es #1 SMP Sun Aug 8
13:02:24 BST 2004 i686 GNU/Linux"
> planner:        CC="gcc"
> planner:        CONFIGURE_COMMAND="'./configure' '--prefix=/usr'
'--bindir=/usr/sbin' '--mandir=/usr/share/man'
'--libexecdir=/usr/lib/amanda' '--enable-shared' '--sysconfdir=/etc'
'--localstatedir=/var/lib'
'--with-gnutar-listdir=/var/lib/amanda/gnutar-lists'
'--with-index-server=localhost' '--with-user=backup' '--with-group=backup'
'--with-bsd-security' '--with-amandahosts'
'--with-smbclient=/usr/bin/smbclient' '--with-tcpportrange=50000,50100'
'--with-udpportrange=840,860'"
> planner: paths: bindir="/usr/sbin" sbindir="/usr/sbin"
> planner:        libexecdir="/usr/lib/amanda" mandir="/usr/share/man"
> planner:        AMANDA_TMPDIR="/tmp/amanda" AMANDA_DBGDIR="/tmp/amanda"
> planner:        CONFIG_DIR="/etc/amanda" DEV_PREFIX="/dev/"
> planner:        RDEV_PREFIX="/dev/r" DUMP="/sbin/dump"
> planner:        RESTORE="/sbin/restore" VDUMP=UNDEF VRESTORE=UNDEF
> planner:        XFSDUMP="/sbin/xfsdump" XFSRESTORE="/sbin/xfsrestore"
> planner:        VXDUMP=UNDEF VXRESTORE=UNDEF
> planner:        SAMBA_CLIENT="/usr/bin/smbclient" GNUTAR="/bin/tar"
> planner:        COMPRESS_PATH="/bin/gzip" UNCOMPRESS_PATH="/bin/gzip"
> planner:        LPRCMD="/usr/bin/lpr" MAILER="/usr/bin/Mail"
> planner:        listed_incr_dir="/var/lib/amanda/gnutar-lists"
> planner: defs:  DEFAULT_SERVER="localhost" DEFAULT_CONFIG="DailySet1"
> planner:        DEFAULT_TAPE_SERVER="localhost"
> planner:        DEFAULT_TAPE_DEVICE="/dev/null" HAVE_MMAP HAVE_SYSVSHM
> planner:        LOCKING=POSIX_FCNTL SETPGRP_VOID DEBUG_CODE
> planner:        AMANDA_DEBUG_DAYS=4 BSD_SECURITY USE_AMANDAHOSTS
> planner:        CLIENT_LOGIN="backup" FORCE_USERID HAVE_GZIP
> planner:        COMPRESS_SUFFIX=".gz" COMPRESS_FAST_OPT="--fast"
> planner:        COMPRESS_BEST_OPT="--best" UNCOMPRESS_OPT="-dc"
> planner: time 0.000: dgram_bind: socket bound to 0.0.0.0.858
> READING CONF FILES...
> planner: time 0.000: startup took 0.000 secs
> 
> SETTING UP FOR ESTIMATES...
> planner: time 0.000: setting up estimates for alsike:alsike_boot
> alsike:alsike_boot overdue 12854 days for level 0
> setup_estimate: alsike:alsike_boot: command 0, options:
>     last_level 0 next_level0 -12854 level_days 0
>     getting estimates 0 (-1) 1 (0) -1 (-1)
> planner: time 0.000: setting up estimates took 0.000 secs
> 
> GETTING ESTIMATES...
> driver: pid 16004 executable /usr/lib/amanda/driver version 2.4.4p2
> driver: tape size 40960000
> driver: send-cmd time 0.001 to taper: START-TAPER 20050311
> driver: started dumper0 pid 16006
> dumper: dgram_bind: socket bound to 0.0.0.0.846
> dumper: pid 16012 executable dumper1 version 2.4.4p2, using port 846
> driver: started dumper1 pid 16012
> planner: time 0.032: got result for host alsike disk alsike_boot: 0 ->
31700K, 1 -> 10K, -1 -> -1K
> planner: time 0.032: getting estimates took 0.031 secs
> FAILED QUEUE: empty
> DONE QUEUE:
>   0: alsike     alsike_boot
> 
> ANALYZING ESTIMATES...
> pondering alsike:alsike_boot... next_level0 -12854 last_level 0 (due for
level 0) (picking inclevel for degraded mode)   picklev: last night 0, so
tonight level 1
> 
>   curr level 0 size 15850 total size 15970 total_lev0 15850
balanced-lev0size 2264
> INITIAL SCHEDULE (size 15970):
>   alsike alsike_boot pri 12856 lev 0 size 15850
> 
> DELAYING DUMPS IF NEEDED, total_size 15970, tape length 40960000 mark 8
>   delay: Total size now 15970.
> 
> PROMOTING DUMPS IF NEEDED, total_lev0 15850, balanced_size 2264...
> planner: time 0.032: analysis took 0.000 secs
> 
> GENERATING SCHEDULE:
> --------
> ENDFLUSH
> DUMP alsike fffffeff9ffe0f alsike_boot 20050311 12856 0 1970:1:1:0:0:0
15850 528 1 1970:1:1:0:0:0 5 0
> --------
> reserving 0 out of 0 for degraded-mode dumps
> driver: flush size 0
> driver: start time 0.029 inparallel 2 bandwidth 600 diskspace 0 dir
OBSOLETE datestamp 20050311 driver: drain-ends tapeq LARGESTFIT big-dumpers
ttt
> dumper: dgram_bind: socket bound to 0.0.0.0.840
> dumper: pid 16006 executable dumper0 version 2.4.4p2, using port 840
> taper: read label `Norstead01' date `X'
> taper: wrote label `Norstead01' date `20050311'
> driver: result time 9.015 from taper: TAPER-OK
> driver: dumping alsike:alsike_boot directly to tape
> driver: send-cmd time 9.016 to taper: PORT-WRITE 00-00001 alsike
fffffeff9ffe0f alsike_boot 0 20050311
> taper: try_socksize: receive buffer size is 65536
> taper: stream_server: waiting for connection: 0.0.0.0.50083
> driver: result time 9.016 from taper: PORT 50083
> driver: send-cmd time 9.016 to dumper0: PORT-DUMP 01-00002 50083 alsike
fffffeff9ffe0f alsike_boot /boot 0 1970:1:1:0:0:0 GNUTAR
|;bsd-auth;srvcomp-fast;
> dumper: stream_client: connected to 127.0.0.1.50083
> dumper: stream_client: our side is 0.0.0.0.50084
> dumper: try_socksize: send buffer size is 65536
> taper: stream_accept: connection from 127.0.0.1.50084
> taper: try_socksize: receive buffer size is 32768
> driver: state time 9.018 free kps: 570 space: 0 taper: writing
idle-dumpers: 1 qlen tapeq: 0 runq: 0 roomq: 0 wakeup: 86400 driver-idle:
not-idle
> driver: interface-state time 9.018 if : free 570
> driver: hdisk-state time 9.018
> dumper: stream_client: connected to 127.0.0.1.50097
> dumper: stream_client: our side is 0.0.0.0.50085
> dumper: stream_client: connected to 127.0.0.1.50098
> dumper: stream_client: our side is 0.0.0.0.50086

   v
> taper: writing end marker. [Norstead01 ERR kb 192 fm 1]
> dumper: kill compress command
> driver: result time 18.083 from dumper0: FAILED 01-00002 ["data write:
Connection reset by peer"]
> driver: result time 18.084 from taper: TAPE-ERROR 00-00001 [writing file:
Input/output error]
   ^

> dump of driver schedule before start degraded mode:
> --------
> --------
> dump of driver schedule after start degraded mode:
> --------
> --------
> driver: state time 18.084 free kps: 600 space: 0 taper: DOWN idle-dumpers:
2 qlen tapeq: 0 runq: 0 roomq: 0 wakeup: 86400 driver-idle: not-idle
> driver: interface-state time 18.084 if : free 600
> driver: hdisk-state time 18.084
> driver: QUITTING time 18.084 telling children to quit
> driver: send-cmd time 18.084 to dumper0: QUIT
> driver: send-cmd time 18.084 to dumper1: QUIT
> driver: send-cmd time 18.084 to taper: QUIT
> taper: DONE [idle wait: 0.000 secs]
> driver: FINISHED time 18.084
> amdump: end at Fri Mar 11 15:27:17 GMT 2005
 

 - - - - - - - - - /tmp/amanda/sendbackup.20050311152708.debug - - - - - - -
- - 

> sendbackup: debug 1 pid 16019 ruid 34 euid 34: start at Fri Mar 11
15:27:08 2005
> /usr/lib/amanda/sendbackup: version 2.4.4p2
>   parsed request as: program `GNUTAR'
>                      disk `alsike_boot'
>                      device `/boot'
>                      level 0
>                      since 1970:1:1:0:0:0
>                      options `|;bsd-auth;srvcomp-fast;'
> sendbackup: try_socksize: send buffer size is 65536
> sendbackup: time 0.000: stream_server: waiting for connection:
0.0.0.0.50097
> sendbackup: time 0.000: stream_server: waiting for connection:
0.0.0.0.50098
> sendbackup: time 0.001: waiting for connect on 50097, then 50098
> sendbackup: time 0.001: stream_accept: connection from 127.0.0.1.50085
> sendbackup: time 0.003: stream_accept: connection from 127.0.0.1.50086
> sendbackup: time 0.003: got all connections
> sendbackup-gnutar: time 0.003: doing level 0 dump as listed-incremental to
/var/lib/amanda/gnutar-lists/alsikealsike_boot_0.new
> sendbackup-gnutar: time 0.006: doing level 0 dump from date: 1970-01-01
0:00:00 GMT
> sendbackup: time 0.007: spawning /usr/lib/amanda/runtar in pipeline
> sendbackup: argument list: gtar --create --file - --directory /boot
--one-file-system --listed-incremental
/var/lib/amanda/gnutar-lists/alsikealsike_boot_0.new --sparse
--ignore-failed-read --totals .
> sendbackup-gnutar: time 0.122: /usr/lib/amanda/runtar: pid 16022

   v
> sendbackup: time 9.062: error [/bin/tar got signal 13]
   ^

> sendbackup: time 9.062: pid 16019 finish time Fri Mar 11 15:27:17 2005


-- 

Norstead Farm - Bruce & Carole Skinner
RR#1 Waterville NS Canada B0P 1V0
Tel: 902-538-1765
Fax: 902-538-1794
<mailto:norstead AT ns.sympatico DOT ca>


************************************************************************
This e-mail and any of its attachments may contain Exelon Corporation
proprietary information, which is privileged, confidential, or subject 
to copyright belonging to the Exelon Corporation family of Companies. 
This e-mail is intended solely for the use of the individual or entity 
to which it is addressed.  If you are not the intended recipient of this 
e-mail, you are hereby notified that any dissemination, distribution, 
copying, or action taken in relation to the contents of and attachments 
to this e-mail is strictly prohibited and may be unlawful.  If you have 
received this e-mail in error, please notify the sender immediately and 
permanently delete the original and any copy of this e-mail and any 
printout. Thank You.
************************************************************************


<Prev in Thread] Current Thread [Next in Thread>