BackupPC-users

[BackupPC-users] The dread "Unable to read 4 bytes" / "Read EOF: Connection reset by peer"

2010-05-10 21:43:51
Subject: [BackupPC-users] The dread "Unable to read 4 bytes" / "Read EOF: Connection reset by peer"
From: Nick Bright <oz AT valnet DOT net>
To: "General list for user discussion, questions and support" <backuppc-users AT lists.sourceforge DOT net>
Date: Mon, 10 May 2010 20:14:54 -0500
Let me start off by saying this: I know what I'm doing.

I've been running backuppc for about two years on two servers, backing 
up about 25 servers (mix of windows and linux).

This is the ONLY machine I've ever had this problem with that wasn't SSH 
authentication problems, and what's worse is that it worked for almost a 
year before it stopped working. I'm convinced it's something about the 
shell or environment of the client system, but I've been trying to 
figure this out since last *November* and I'm just not getting anywhere 
with it. Every single hit just says "your SSH keys are messed up", but 
they most certainly are /not/ messed up, as evidenced below.

All of the configuration is the same as my other linux servers. I can 
find absolutely nothing preventing this from working, but it fails every 
time!

Please. I'm _BEGGING_ for something here, I have NO idea where else to 
look! There is unfortunately no extended troubleshooting information 
available in the documentation located at 
http://sourceforge.net/apps/mediawiki/backuppc/index.php?title=ErrorMessages 
or http://backuppc.sourceforge.net/faq/ssh.html or 
http://backuppc.sourceforge.net/faq/BackupPC.html#step_5

The target machine is CentOS5 running cPanel, and I've updated rsync to 
rsync-3.0.7-1.el5.rf (from RPM Forge, had to do that on another box as 
well).

What else can I do to find extended debugging information?

The xferlog always says:

full backup started for directory / (baseline backup #259)
Running: /usr/bin/ssh -q -x -l root TargetServer /usr/bin/rsync --server 
--sender --numeric-ids --perms --owner --group -D --links --hard-links 
--times --block-size=2048 --recursive --ignore-times . /
Xfer PIDs are now 2924
Read EOF: Connection reset by peer
Tried again: got 0 bytes
Done: 0 files, 0 bytes
Got fatal error during xfer (Unable to read 4 bytes)
Backup aborted (Unable to read 4 bytes)
Not saving this as a partial backup since it has fewer files than the 
prior one (got 0 and 0 files versus 0)

Yet, from a shell on the BackupPC server, as the backuppc user, I have 
no problems SSHing to the target:

[backuppc@BackupPCServer ~]$ ssh root@TargetServer
Last login: Mon May 10 19:36:24 2010 from 192.168.50.1
-bash-3.2#
[backuppc@BackupPCServer ~]$ ssh root@TargetServer whoami
root
[backuppc@BackupPCServer ~]$ ssh -q -x -l root TargetServer
Last login: Mon May 10 19:42:03 2010 from 192.168.50.1
-bash-3.2#

[backuppc@BackupPCServer ~]$ /usr/local/backuppc/bin/BackupPC_dump -v 
TargetServer
cmdSystemOrEval: about to system /bin/ping -c 1 -w 3 TargetServer
cmdSystemOrEval: finished: got output PING TargetServer (192.168.50.2) 
56(84) bytes of data.
64 bytes from TargetServer (192.168.50.2): icmp_seq=1 ttl=64 time=0.133 ms

--- TargetServer ping statistics ---
1 packets transmitted, 1 received, 0% packet loss, time 0ms
rtt min/avg/max/mdev = 0.133/0.133/0.133/0.000 ms

cmdSystemOrEval: about to system /bin/ping -c 1 -w 3 TargetServer
cmdSystemOrEval: finished: got output PING TargetServer (192.168.50.2) 
56(84) bytes of data.
64 bytes from TargetServer (192.168.50.2): icmp_seq=1 ttl=64 time=0.212 ms

--- TargetServer ping statistics ---
1 packets transmitted, 1 received, 0% packet loss, time 0ms
rtt min/avg/max/mdev = 0.212/0.212/0.212/0.000 ms

CheckHostAlive: returning 0.212
full backup started for directory / (baseline backup #259)
started full dump, share=/
Running: /usr/bin/ssh -q -x -l root TargetServer /usr/bin/rsync --server 
--sender --numeric-ids --perms --owner --group -D --links --hard-links 
--times --block-size=2048 --recursive --port=38271 --ignore-times . /
Xfer PIDs are now 3040
xferPids 3040
Read EOF: Connection reset by peer
Tried again: got 0 bytes
Done: 0 files, 0 bytes
Got fatal error during xfer (Unable to read 4 bytes)
cmdSystemOrEval: about to system /bin/ping -c 1 -w 3 TargetServer
cmdSystemOrEval: finished: got output PING TargetServer (192.168.50.2) 
56(84) bytes of data.
64 bytes from TargetServer (192.168.50.2): icmp_seq=1 ttl=64 time=0.167 ms

--- TargetServer ping statistics ---
1 packets transmitted, 1 received, 0% packet loss, time 0ms
rtt min/avg/max/mdev = 0.167/0.167/0.167/0.000 ms

cmdSystemOrEval: about to system /bin/ping -c 1 -w 3 TargetServer
cmdSystemOrEval: finished: got output PING TargetServer (192.168.50.2) 
56(84) bytes of data.
64 bytes from TargetServer (192.168.50.2): icmp_seq=1 ttl=64 time=0.165 ms

--- TargetServer ping statistics ---
1 packets transmitted, 1 received, 0% packet loss, time 0ms
rtt min/avg/max/mdev = 0.165/0.165/0.165/0.000 ms

CheckHostAlive: returning 0.165
Backup aborted (Unable to read 4 bytes)
Not saving this as a partial backup since it has fewer files than the 
prior one (got 0 and 0 files versus 0)
dump failed: Unable to read 4 bytes

Any help or tips are greatly appreciated.

Thank you!

   - Nick

------------------------------------------------------------------------------

_______________________________________________
BackupPC-users mailing list
BackupPC-users AT lists.sourceforge DOT net
List:    https://lists.sourceforge.net/lists/listinfo/backuppc-users
Wiki:    http://backuppc.wiki.sourceforge.net
Project: http://backuppc.sourceforge.net/