Bacula-users

Re: [Bacula-users] Network error with FD during Backup: ERR=Connection reset by peer

2012-09-27 03:33:59
Subject: Re: [Bacula-users] Network error with FD during Backup: ERR=Connection reset by peer
From: "Michael Neuendorf" <michael.neuendorf AT novanetgmbh DOT de>
To: <bacula-users AT lists.sourceforge DOT net>
Date: Thu, 27 Sep 2012 09:31:25 +0200
> > 2012-09-19 22:58:45   bacula-dir JobId 13962: Start Backup JobId
13962,
> Job=nina_systemstate.2012-09-19_21.50.01_31
> > 2012-09-19 22:58:46   bacula-dir JobId 13962: Using Device
> "FileStorageLocal"
> > 2012-09-19 23:02:41   nina-fd JobId 13962: DIR and FD clocks differ
by 233
> seconds, FD automatically compensating.
> > 2012-09-19 23:02:41   nina-fd JobId 13962: DIR and FD clocks differ
by 233
> seconds, FD automatically compensating.
> > 2012-09-19 23:02:45   nina-fd JobId 13962: shell command: run
> ClientRunBeforeJob "C:/backup/bacula/systemstate.cmd"
> > 2012-09-19 23:02:45   nina-fd JobId 13962: shell command: run
> ClientRunBeforeJob "C:/backup/bacula/systemstate.cmd"
> > 2012-09-19 23:03:40   bacula-dir JobId 13962: Sending Accurate
information.
> > 2012-09-19 23:05:12   bacula-dir-sd JobId 13962: Job write elapsed
time =
> 00:01:21, Transfer rate = 2.517 M Bytes/second
> > 2012-09-19 23:09:06   nina-fd JobId 13962: shell command: run
> ClientAfterJob "C:/backup/bacula/systemstate.cmd cleanup"
> > 2012-09-19 23:09:06   nina-fd JobId 13962: shell command: run
> ClientAfterJob "C:/backup/bacula/systemstate.cmd cleanup"
> > 2012-09-19 23:05:17   bacula-dir JobId 13962: Fatal error: Network
error with
> FD during Backup: ERR=Connection reset by peer
> 
> We have seen that same error ("Connection reset by peer") ocassionally
for
> many months.  Some are normal - Mac/Windows desktops/laptops that
> either get rebooted or removed from the network during a backup, etc.
> But sometimes we see this error with UNIX servers that are up 24x7.
We
> suspect that it is network related since we've had similar errors with
print
> servers and non-Bacula backup servers.  But we have yet to pin it
down.  We
> restart failed jobs in Bacula so typically the job always completes OK
even
> after initially getting this error on the first try.
> I'd be curious to know if others get these errors occasionally and
what
> version of Bacula that you're running.

I did a couple of installations and I never faced with this error
before. Anyway, never say never again.
 In the first scenario we were backing up to  tape for a few years and
then migrated to a disc based solution. Everything worked like a charm.
This particular problem occurred first, when we migrated the "problem
server" from a physical machine to a virtualized one (with VMware
converter). As I mentioned in the reply to Josh, there is another
virtual server on this host without any problems.

Has anyone probably issues with nic drivers, too. I used a mix of E1000
or "flexible" in the vm config.

However, can someone tell me, where the problem has its origin. Is it
the FD, SD or the Dir? It's not clear for me.

Tom: How did you restart the job. Did you have a script or do you do it
by hand?

Regards
Michael
 

 
NovaNet GmbH
Kupferstr. 65
44532 Lunen
Telefon: 02306/202100
FAX: 02306/202109
WEB: www.novanetgmbh.de
Firmensitz: Lunen
Amtsgericht Dortmund HRB 17273
USt-ID DE 124793480, St.-Nr. 316/5759/0318
Geschaftsfuhrerin: Dipl. Informatikerin (FH) Desiree Wunsche

------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://ad.doubleclick.net/clk;258768047;13503038;j?
http://info.appdynamics.com/FreeJavaPerformanceDownload.html
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users