Bacula-users

Re: [Bacula-users] Fatal error: backup.c:892 Network send error to SD. ERR=Connection reset by peer

2010-04-21 07:15:38
Subject: Re: [Bacula-users] Fatal error: backup.c:892 Network send error to SD. ERR=Connection reset by peer
From: Matija Nalis <mnalis+bacula AT CARNet DOT hr>
To: Jon Schewe <jpschewe AT mtu DOT net>
Date: Wed, 21 Apr 2010 13:13:12 +0200
On Sun, Apr 18, 2010 at 11:46:33AM -0500, Jon Schewe wrote:
> > http://wiki.bacula.org/doku.php?id=faq#my_backup_starts_but_dies_after_a_while_with_connection_reset_by_peer_error
> >
> > [1] It actually tries that at one point in src/lib/bsock.c if
> >     TCP_KEEPIDLE support is detected, but it fails to detect it
> >     properly because <netinet/tcp.h> is not included.
> >
> >     However, even after fixing that (and missing semicolon in 
> >     'int opt = heart_beat' line), it still doesn't look like it sets
> >     TCP_KEEPIDLE correctly on FD->SD connection, so maybe this
> >     codepath is not used there. 
> >
> >     Anyway I gave up debugging there and just set the system
> >     defaults. But I just though I'd mention that in case someone
> >     else wants to continue chasing the bug.
> >
> >   
> Hmm, this sounds like a bug that should be fixed and once it is fixed
> may remove a bunch of problems with firewalls.

FYI, I've put up a patch which fixes current support on bacula-devel
mailing list. That support could be extended (as not all parts of
bacula use that function), but it might be enough. 

If someone is willing to try it, let me (or better, the whole list)
know how it fares and if it fixes the timeouts without the user
needing to resort to changing systems defaults.

------------------------------------------------------------------------------
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users