Bacula-users

[Bacula-users] Backup finished, but "Fatal error: Network error with FD"/"Connection reset by peer"

2015-08-03 07:14:51
Subject: [Bacula-users] Backup finished, but "Fatal error: Network error with FD"/"Connection reset by peer"
From: Raimund Sacherer <rs AT logitravel DOT com>
To: "Bacula-users AT lists.sourceforge DOT net" <bacula-users AT lists.sourceforge DOT net>
Date: Mon, 3 Aug 2015 12:52:43 +0200 (CEST)
Hello, 

we use bacula now for about 6 years and it works great. Some 6 month ago we 
switched to another Bacula Server. The switch included changing the OS from 
Linux to FreeBSD (to get better flexibility with ZFS, etc.).

Since the move we experience some problems. I keep logs for about 3 month so 
right now I can say that for about 3100 Backup Jobs 11 Jobs fail with:

02-Aug 09:13 backupserver-dir JobId 110334: Fatal error: Network error with FD 
during Backup: ERR=Connection reset by peer
02-Aug 09:13 backupserver-dir JobId 110334: Error: Bacula backupserver-dir 
5.2.12 (12Sep12):

But it seems the backup has finished just fine. After changing the the volume 
status from Error to Used we can restore files. 

It seems that after the backup is finished, a communication attempt between the 
director and the client fails somehow. 

All those clients are in the same LAN Network. The backup time is comparable, 
also the amount of files etc. 

It *seems* to only affect Windows, but I can not verify this fact as I do not 
have logs beyond 3 month. 

I read some-where that there could be problems with some sort of timeout in the 
FreeBSD network stack, but before twiddling with some knobs I really would 
appreciate if someone else had similar problems in the past and knows what the 
root cause is. 


Here an example of one failed (02. Aug) and two success backups from the same 
job over the last 3 weeks:


ERROR:
02-Aug 09:13 backupserver-dir JobId 110334: Fatal error: Network error with FD 
during Backup: ERR=Connection reset by peer
02-Aug 09:13 backupserver-dir JobId 110334: Error: Bacula backupserver-dir 
5.2.12 (12Sep12):
  Scheduled time:         01-Aug-2015 15:03:01
  Start time:             01-Aug-2015 22:08:09
  End time:               02-Aug-2015 09:13:31
  Elapsed time:           11 hours 5 mins 22 secs
  Priority:               12
  FD Files Written:       547,868
  SD Files Written:       547,868
  FD Bytes Written:       1,202,478,052,857 (1.202 TB)
  SD Bytes Written:       1,202,608,809,023 (1.202 TB)
  Rate:                   30120.7 KB/s


OK:
  Scheduled time:         25-Jul-2015 15:03:00
  Start time:             25-Jul-2015 21:52:17
  End time:               26-Jul-2015 08:24:29
  Elapsed time:           10 hours 32 mins 12 secs
  Priority:               12
  FD Files Written:       545,740
  SD Files Written:       545,740
  FD Bytes Written:       1,193,323,311,780 (1.193 TB)
  SD Bytes Written:       1,193,453,558,522 (1.193 TB)
  Rate:                   31459.5 KB/s


OK:
  Scheduled time:         18-Jul-2015 15:03:00
  Start time:             18-Jul-2015 20:26:21
  End time:               19-Jul-2015 05:52:25
  Elapsed time:           9 hours 26 mins 4 secs
  Priority:               12
  FD Files Written:       543,122
  SD Files Written:       543,122
  FD Bytes Written:       1,176,812,345,702 (1.176 TB)
  SD Bytes Written:       1,176,941,989,504 (1.176 TB)
  Rate:                   34648.8 KB/s



Thank you,
Best regards
Ray


------------------------------------------------------------------------------
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users