Amanda-Users

Re: Backups fail on new client: too many dumper retry

2006-06-21 15:35:33
Subject: Re: Backups fail on new client: too many dumper retry
From: Jon LaBadie <jon AT jgcomp DOT com>
To: amanda-users AT amanda DOT org
Date: Wed, 21 Jun 2006 15:30:45 -0400
On Wed, Jun 21, 2006 at 01:07:40PM -0600, Charles Curley wrote:
> On Wed, Jun 21, 2006 at 02:06:28PM -0400, Jon LaBadie wrote:
> > On Wed, Jun 21, 2006 at 11:43:31AM -0600, Charles Curley wrote:
> > > On Wed, Jun 21, 2006 at 12:51:45PM -0400, Jon LaBadie wrote:
> > > 
> > > > runtar is called from other programs and they write debug info to
> > > > things like sendsize and sendbackup.  Anything interesting there?
> > > 
> > > Nothing that jumps out. Here's a recent sendbackup:
> > > --------------------------------------------------
> > > sendbackup: debug 1 pid 7550 ruid 33 euid 33: start at Wed Jun 21 
> > > 11:40:12 2006
> > > /usr/lib/amanda/sendbackup: version 2.4.5p1
> > >   parsed request as: program `GNUTAR'
> > >                      disk `/etc'
> > >                      device `/etc'
> > >                      level 0
> > >                      since 1970:1:1:0:0:0
> > >                      options `|;bsd-auth;compress-fast;index;'
> > > sendbackup: try_socksize: send buffer size is 65536
> > > sendbackup: time 0.001: stream_server: waiting for connection: 
> > > 0.0.0.0.44983
> > > sendbackup: time 0.001: stream_server: waiting for connection: 
> > > 0.0.0.0.37243
> > > sendbackup: time 0.001: stream_server: waiting for connection: 
> > > 0.0.0.0.50509
> > > sendbackup: time 0.001: waiting for connect on 44983, then 37243, then 
> > > 50509
> > > sendbackup: time 30.002: stream_accept: timeout after 30 seconds
> > > sendbackup: time 30.002: timeout on data port 44983
> > > sendbackup: time 60.000: stream_accept: timeout after 30 seconds
> > > sendbackup: time 60.000: timeout on mesg port 37243
> > > sendbackup: time 90.000: stream_accept: timeout after 30 seconds
> > > sendbackup: time 90.001: timeout on index port 50509
> > > sendbackup: time 90.001: pid 7550 finish time Wed Jun 21 11:41:42 2006
> > > --------------------------------------------------
> > 
> > Uhhh, Nothing stands out?  Nothing like trying to set up the 3 needed
> > connections and not connecting, but timing out instead?
> > 
> 
> Nothing other than the problem I've already asked about & to which
> I've gotten no answer: Is the client trying to connect to
> 0.0.0.0.50509? And if so, where is that set up?

My networking is pretty weak, so hopefully someone else will jump in.

That log is from the server, correct?  What I think is happening is
that the server is creating 3 sockets as shown.  Then asking the
client to do the same, attaching to the new sockets and ?perhaps?
receiving the clients socket numbers in return.  Except that it is
not getting any response from the client.

Whether the problem is the request is not sent properly, not received
properly, refused with no acknowledgement, or the reply doesn't get
received properly by the server I don't know.

Perhaps a network traffic monitor like netcat/tcpdump/snoop might
pinpoint where the problem lies.

-- 
Jon H. LaBadie                  jon AT jgcomp DOT com
 JG Computing
 4455 Province Line Road        (609) 252-0159
 Princeton, NJ  08540-4322      (609) 683-7220 (fax)