Amanda-Users

Re: signal 13 (PIPE) error.

2005-03-14 21:47:24
Subject: Re: signal 13 (PIPE) error.
From: Bruce.Skinner AT drdc-rddc.gc DOT ca (Bruce S. Skinner)
To: amanda-users AT amanda DOT org
Date: 15 Mar 2005 02:16:58 +0000
Hello Don,

donald.ritchey AT exeloncorp DOT com writes:

> Bruce:
> 
> I don't want you to feel like we are all jumping on you, but...

I don't (and I have a thick skin).  You may have misconstrued my
comments about the "black hole" as inferring a lack of answers from
the list.  On the contrary, I've found people here are quite helpful
and I am very appreciative.  I was referring to the fact that I have
actually sent close to a dozen copies of this e-mail to the list, but
it never showed up on the list and I received no bounced email either.
I'm guessing that somewhere along the way, something decided my email
was spam and ate it ;-)

> First:
> 
> I noticed that you have used localhost (127.0.0.1) as the address 
> for your network connections, and as the archives will illustrate 
> in incredible detail, we generally consider that a Bad Idea (Tm).

I had missed that.  The only archive of the list that I was aware of
until today was yahoo and I find that excruciating to search with.
However I saw a post today that pointed me to gmane, searching there
is better I believe.

> Second:
> 
> Removing the holding disk from the picture actually makes the picture
> more murky, not less.  If you have a holding disk, then the backup 
> of the data happens in two parts, separating the dumper process of
> getting the data off of the disk to be backed up and onto the holding
> disk, and then the taper process moves the backed up data from
> the holding disk to the tape drive.  Combining the two obscures 
> where the actual problem is:  in the dumper or in the taper.
> My natural instinct is to look at the tape drive and the taper
> portion of the process, since that is a more mechanically driven 
> process and past history (mine) indicates that most of the errors
> that I have encountered have been there.

I did have the holding disk configured and got the same error, I also
got the error during subsequent amflush operations.

> Repeat the experiment, correcting these two issues and I think the
> picture will get a little clearer.

I will, thanks for the pointers.

regards :-)
BruceS

> Best wishes,
> 
> Don
> Donald L. (Don) Ritchey
> Information Technology
> Exelon Corporation
> 
> -----Original Message-----
> From: norstead AT ns.sympatico DOT ca [mailto:norstead AT ns.sympatico DOT ca]
> Sent: Monday, March 14, 2005 4:37 PM
> To: amanda-users AT amanda DOT org
> Subject: signal 13 (PIPE) error.
> 
> 
> 
> My last dozen or so emails to amanda-users have gone into a black
> hole, I'll try this from another network under another subject.
> 
> 
> So let's try and walk (crawl?) before we run.
> 
> I've set things up with only one small disk to be backed up, set
> dumpcycle 0 and removed the holding disk and tape changer from the
> config.  It's still failing.  There is a signal 13 (PIPE) error in
> sendbackup.debug.
> 
> The amanda log shows:
>   INFO taper tape Norstead01 kb 192 fm 1 writing file: Input/output error
>   FAIL dumper alsike alsike_boot 20050311 0 ["data write: Connection reset
> by peer"]
> 
> The amdump log shows:
>  taper : writing end marker. [Norstead01 ERR kb 192 fm 1]
>  dumper : kill compress command
>  driver : result time 18.083 from dumper0: FAILED 01-00002 ["data write:
> Connection reset by peer"]
>  driver : result time 18.084 from taper: TAPE-ERROR 00-00001 [writing file:
> Input/output error]
> 
> The sendbackup.20050311152708.debug shows:
>  sendbackup : time 9.062: error [/bin/tar got signal 13]
> 
> Does anybody know what this means?
> 
> regards :-)
> BruceS
> 
> 
>  - - - - - - - - - /var/log/amanda/Norstead/log.20050311.0 - - - - - - - - -
> 
> > DISK planner alsike alsike_boot
> > START planner date 20050311
> > WARNING planner Last full dump of alsike:alsike_boot on tape  overwritten
> in 1 run.
> > START driver date 20050311
> > FINISH planner date 20050311
> > STATS driver startup time 0.029
> > START taper datestamp 20050311 label Norstead01 tape 0
> 
>    v
> > INFO taper tape Norstead01 kb 192 fm 1 writing file: Input/output error
> > FAIL dumper alsike alsike_boot 20050311 0 ["data write: Connection reset
> by peer"]
>    ^
> 
> >   sendbackup: start [alsike:alsike_boot level 0]
> >   sendbackup: info BACKUP=/bin/tar
> >   sendbackup: info RECOVER_CMD=/bin/tar -f... -
> >   sendbackup: info end
> > FAIL taper alsike alsike_boot 20050311 0 [out of tape]
> > ERROR taper no-tape [[writing file: Input/output error]]
> > FAIL driver alsike alsike_boot 20050311 0 [dump to tape failed]
> > FINISH driver date 20050311 time 18.084
> 
> 
> 
> 
> 
>  - - - - - - - - - - /var/log/amanda/Norstead/amdump.1 - - - - - - - - - - 
> 
> > amdump: start at Fri Mar 11 15:26:59 GMT 2005
> > amdump: datestamp 20050311
> > planner: pid 16003 executable /usr/lib/amanda/planner version 2.4.4p2
> > planner: build: VERSION="Amanda-2.4.4p2"
> > planner:        BUILT_DATE="Fri Aug 13 18:32:08 UTC 2004"
> > planner:        BUILT_MACH="Linux mcmurdo 2.4.27-es #1 SMP Sun Aug 8
> 13:02:24 BST 2004 i686 GNU/Linux"
> > planner:        CC="gcc"
> > planner:        CONFIGURE_COMMAND="'./configure' '--prefix=/usr'
> '--bindir=/usr/sbin' '--mandir=/usr/share/man'
> '--libexecdir=/usr/lib/amanda' '--enable-shared' '--sysconfdir=/etc'
> '--localstatedir=/var/lib'
> '--with-gnutar-listdir=/var/lib/amanda/gnutar-lists'
> '--with-index-server=localhost' '--with-user=backup' '--with-group=backup'
> '--with-bsd-security' '--with-amandahosts'
> '--with-smbclient=/usr/bin/smbclient' '--with-tcpportrange=50000,50100'
> '--with-udpportrange=840,860'"
> > planner: paths: bindir="/usr/sbin" sbindir="/usr/sbin"
> > planner:        libexecdir="/usr/lib/amanda" mandir="/usr/share/man"
> > planner:        AMANDA_TMPDIR="/tmp/amanda" AMANDA_DBGDIR="/tmp/amanda"
> > planner:        CONFIG_DIR="/etc/amanda" DEV_PREFIX="/dev/"
> > planner:        RDEV_PREFIX="/dev/r" DUMP="/sbin/dump"
> > planner:        RESTORE="/sbin/restore" VDUMP=UNDEF VRESTORE=UNDEF
> > planner:        XFSDUMP="/sbin/xfsdump" XFSRESTORE="/sbin/xfsrestore"
> > planner:        VXDUMP=UNDEF VXRESTORE=UNDEF
> > planner:        SAMBA_CLIENT="/usr/bin/smbclient" GNUTAR="/bin/tar"
> > planner:        COMPRESS_PATH="/bin/gzip" UNCOMPRESS_PATH="/bin/gzip"
> > planner:        LPRCMD="/usr/bin/lpr" MAILER="/usr/bin/Mail"
> > planner:        listed_incr_dir="/var/lib/amanda/gnutar-lists"
> > planner: defs:  DEFAULT_SERVER="localhost" DEFAULT_CONFIG="DailySet1"
> > planner:        DEFAULT_TAPE_SERVER="localhost"
> > planner:        DEFAULT_TAPE_DEVICE="/dev/null" HAVE_MMAP HAVE_SYSVSHM
> > planner:        LOCKING=POSIX_FCNTL SETPGRP_VOID DEBUG_CODE
> > planner:        AMANDA_DEBUG_DAYS=4 BSD_SECURITY USE_AMANDAHOSTS
> > planner:        CLIENT_LOGIN="backup" FORCE_USERID HAVE_GZIP
> > planner:        COMPRESS_SUFFIX=".gz" COMPRESS_FAST_OPT="--fast"
> > planner:        COMPRESS_BEST_OPT="--best" UNCOMPRESS_OPT="-dc"
> > planner: time 0.000: dgram_bind: socket bound to 0.0.0.0.858
> > READING CONF FILES...
> > planner: time 0.000: startup took 0.000 secs
> > 
> > SETTING UP FOR ESTIMATES...
> > planner: time 0.000: setting up estimates for alsike:alsike_boot
> > alsike:alsike_boot overdue 12854 days for level 0
> > setup_estimate: alsike:alsike_boot: command 0, options:
> >     last_level 0 next_level0 -12854 level_days 0
> >     getting estimates 0 (-1) 1 (0) -1 (-1)
> > planner: time 0.000: setting up estimates took 0.000 secs
> > 
> > GETTING ESTIMATES...
> > driver: pid 16004 executable /usr/lib/amanda/driver version 2.4.4p2
> > driver: tape size 40960000
> > driver: send-cmd time 0.001 to taper: START-TAPER 20050311
> > driver: started dumper0 pid 16006
> > dumper: dgram_bind: socket bound to 0.0.0.0.846
> > dumper: pid 16012 executable dumper1 version 2.4.4p2, using port 846
> > driver: started dumper1 pid 16012
> > planner: time 0.032: got result for host alsike disk alsike_boot: 0 ->
> 31700K, 1 -> 10K, -1 -> -1K
> > planner: time 0.032: getting estimates took 0.031 secs
> > FAILED QUEUE: empty
> > DONE QUEUE:
> >   0: alsike     alsike_boot
> > 
> > ANALYZING ESTIMATES...
> > pondering alsike:alsike_boot... next_level0 -12854 last_level 0 (due for
> level 0) (picking inclevel for degraded mode)   picklev: last night 0, so
> tonight level 1
> > 
> >   curr level 0 size 15850 total size 15970 total_lev0 15850
> balanced-lev0size 2264
> > INITIAL SCHEDULE (size 15970):
> >   alsike alsike_boot pri 12856 lev 0 size 15850
> > 
> > DELAYING DUMPS IF NEEDED, total_size 15970, tape length 40960000 mark 8
> >   delay: Total size now 15970.
> > 
> > PROMOTING DUMPS IF NEEDED, total_lev0 15850, balanced_size 2264...
> > planner: time 0.032: analysis took 0.000 secs
> > 
> > GENERATING SCHEDULE:
> > --------
> > ENDFLUSH
> > DUMP alsike fffffeff9ffe0f alsike_boot 20050311 12856 0 1970:1:1:0:0:0
> 15850 528 1 1970:1:1:0:0:0 5 0
> > --------
> > reserving 0 out of 0 for degraded-mode dumps
> > driver: flush size 0
> > driver: start time 0.029 inparallel 2 bandwidth 600 diskspace 0 dir
> OBSOLETE datestamp 20050311 driver: drain-ends tapeq LARGESTFIT big-dumpers
> ttt
> > dumper: dgram_bind: socket bound to 0.0.0.0.840
> > dumper: pid 16006 executable dumper0 version 2.4.4p2, using port 840
> > taper: read label `Norstead01' date `X'
> > taper: wrote label `Norstead01' date `20050311'
> > driver: result time 9.015 from taper: TAPER-OK
> > driver: dumping alsike:alsike_boot directly to tape
> > driver: send-cmd time 9.016 to taper: PORT-WRITE 00-00001 alsike
> fffffeff9ffe0f alsike_boot 0 20050311
> > taper: try_socksize: receive buffer size is 65536
> > taper: stream_server: waiting for connection: 0.0.0.0.50083
> > driver: result time 9.016 from taper: PORT 50083
> > driver: send-cmd time 9.016 to dumper0: PORT-DUMP 01-00002 50083 alsike
> fffffeff9ffe0f alsike_boot /boot 0 1970:1:1:0:0:0 GNUTAR
> |;bsd-auth;srvcomp-fast;
> > dumper: stream_client: connected to 127.0.0.1.50083
> > dumper: stream_client: our side is 0.0.0.0.50084
> > dumper: try_socksize: send buffer size is 65536
> > taper: stream_accept: connection from 127.0.0.1.50084
> > taper: try_socksize: receive buffer size is 32768
> > driver: state time 9.018 free kps: 570 space: 0 taper: writing
> idle-dumpers: 1 qlen tapeq: 0 runq: 0 roomq: 0 wakeup: 86400 driver-idle:
> not-idle
> > driver: interface-state time 9.018 if : free 570
> > driver: hdisk-state time 9.018
> > dumper: stream_client: connected to 127.0.0.1.50097
> > dumper: stream_client: our side is 0.0.0.0.50085
> > dumper: stream_client: connected to 127.0.0.1.50098
> > dumper: stream_client: our side is 0.0.0.0.50086
> 
>    v
> > taper: writing end marker. [Norstead01 ERR kb 192 fm 1]
> > dumper: kill compress command
> > driver: result time 18.083 from dumper0: FAILED 01-00002 ["data write:
> Connection reset by peer"]
> > driver: result time 18.084 from taper: TAPE-ERROR 00-00001 [writing file:
> Input/output error]
>    ^
> 
> > dump of driver schedule before start degraded mode:
> > --------
> > --------
> > dump of driver schedule after start degraded mode:
> > --------
> > --------
> > driver: state time 18.084 free kps: 600 space: 0 taper: DOWN idle-dumpers:
> 2 qlen tapeq: 0 runq: 0 roomq: 0 wakeup: 86400 driver-idle: not-idle
> > driver: interface-state time 18.084 if : free 600
> > driver: hdisk-state time 18.084
> > driver: QUITTING time 18.084 telling children to quit
> > driver: send-cmd time 18.084 to dumper0: QUIT
> > driver: send-cmd time 18.084 to dumper1: QUIT
> > driver: send-cmd time 18.084 to taper: QUIT
> > taper: DONE [idle wait: 0.000 secs]
> > driver: FINISHED time 18.084
> > amdump: end at Fri Mar 11 15:27:17 GMT 2005
>  
> 
>  - - - - - - - - - /tmp/amanda/sendbackup.20050311152708.debug - - - - - - -
> - - 
> 
> > sendbackup: debug 1 pid 16019 ruid 34 euid 34: start at Fri Mar 11
> 15:27:08 2005
> > /usr/lib/amanda/sendbackup: version 2.4.4p2
> >   parsed request as: program `GNUTAR'
> >                      disk `alsike_boot'
> >                      device `/boot'
> >                      level 0
> >                      since 1970:1:1:0:0:0
> >                      options `|;bsd-auth;srvcomp-fast;'
> > sendbackup: try_socksize: send buffer size is 65536
> > sendbackup: time 0.000: stream_server: waiting for connection:
> 0.0.0.0.50097
> > sendbackup: time 0.000: stream_server: waiting for connection:
> 0.0.0.0.50098
> > sendbackup: time 0.001: waiting for connect on 50097, then 50098
> > sendbackup: time 0.001: stream_accept: connection from 127.0.0.1.50085
> > sendbackup: time 0.003: stream_accept: connection from 127.0.0.1.50086
> > sendbackup: time 0.003: got all connections
> > sendbackup-gnutar: time 0.003: doing level 0 dump as listed-incremental to
> /var/lib/amanda/gnutar-lists/alsikealsike_boot_0.new
> > sendbackup-gnutar: time 0.006: doing level 0 dump from date: 1970-01-01
> 0:00:00 GMT
> > sendbackup: time 0.007: spawning /usr/lib/amanda/runtar in pipeline
> > sendbackup: argument list: gtar --create --file - --directory /boot
> --one-file-system --listed-incremental
> /var/lib/amanda/gnutar-lists/alsikealsike_boot_0.new --sparse
> --ignore-failed-read --totals .
> > sendbackup-gnutar: time 0.122: /usr/lib/amanda/runtar: pid 16022
> 
>    v
> > sendbackup: time 9.062: error [/bin/tar got signal 13]
>    ^
> 
> > sendbackup: time 9.062: pid 16019 finish time Fri Mar 11 15:27:17 2005
> 
> 
> -- 
> 
> Norstead Farm - Bruce & Carole Skinner
> RR#1 Waterville NS Canada B0P 1V0
> Tel: 902-538-1765
> Fax: 902-538-1794
> <mailto:norstead AT ns.sympatico DOT ca>
> 
> 
> ************************************************************************
> This e-mail and any of its attachments may contain Exelon Corporation
> proprietary information, which is privileged, confidential, or subject 
> to copyright belonging to the Exelon Corporation family of Companies. 
> This e-mail is intended solely for the use of the individual or entity 
> to which it is addressed.  If you are not the intended recipient of this 
> e-mail, you are hereby notified that any dissemination, distribution, 
> copying, or action taken in relation to the contents of and attachments 
> to this e-mail is strictly prohibited and may be unlawful.  If you have 
> received this e-mail in error, please notify the sender immediately and 
> permanently delete the original and any copy of this e-mail and any 
> printout. Thank You.
> ************************************************************************
> 
> 

-- 

Bruce S. Skinner
Defence R&D Canada - Atlantic
9 Grove St.              <mailto:Bruce.Skinner AT drdc-rddc.gc DOT ca>
P.O. Box 1012            <http://www.drdc-rddc.dnd.ca>
Dartmouth  NS
CANADA                    tel: (902) 426-3100 x205
B2Y 3Z7                   fax: (902) 426-9654


<Prev in Thread] Current Thread [Next in Thread>