Bacula-users

Re: [Bacula-users] Fatal error: backup.c:892 Network send error to SD. ERR=Connection reset by peer

2010-04-12 10:42:55
Subject: Re: [Bacula-users] Fatal error: backup.c:892 Network send error to SD. ERR=Connection reset by peer
From: Matija Nalis <mnalis+bacula AT CARNet DOT hr>
To: Jon Schewe <jpschewe AT mtu DOT net>
Date: Mon, 12 Apr 2010 16:40:36 +0200
On Mon, Apr 12, 2010 at 09:23:51AM -0500, Jon Schewe wrote:
> On 4/12/10 9:00 AM, Matija Nalis wrote:
> > (SO_KEEPALIVE will work even with only one side of connection having
> > it enabled).
> >   
> So I should only need the heartbeat on that client's setup as well,
> right? Getting rid of extra heart beats would be nice.

Yes, it should be enough. Note that there is no real need to get rid
of extra heartbeats, they are not really expensive (so biggest gain
is "cleaner" config files).

> > Good, let us know how it fares.
> >   
> It seems to be running, but I've run into a problem with bconsole. Once
> I started the job, if I run bconsole and then "status dir", the console
> hangs. If I strace the bconsole process it's stuck in a select call.
>
> >strace -p 18452
> Process 18452 attached - interrupt to quit
> select(4, [3], NULL, NULL, {9, 461287}) = 0 (Timeout)
> read(3, 0x655d80, 5)                    = -1 EAGAIN (Resource
> temporarily unavailable)

That should not be related to SO_KEEPALIVE - it should be completly
transparent to the applications if the network is working (and even
when it is not working, it should differ only in always terminating
the connection instead of sometimes terminating connection and
sometimes hanging idefinitely).

Anyway, it may be few issues with directory hanging. Most common is
you are too eager. For example, is SQL server is busy, "status dir"
will hang until it completes.

It is especially problem with bigger databases and MySQL instead of
PostgreSQL, see http://bugs.bacula.org/view.php?id=1472, where it can
take even several hours! (note that while it talks about "restore"
speed, it is also related to accurate backups which employ similar
SQL queries)

You can check for this with "show processlist" in MySQL (if you are
running MySQL for database, of course) if that is the case (or simply
wait).

Or you might be unlucky enough to hit a real director bug in 5.0.1,
see http://bugs.bacula.org/view.php?id=1528, but that is unlikely.


------------------------------------------------------------------------------
Download Intel&#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users

<Prev in Thread] Current Thread [Next in Thread>