Bacula-users

[Bacula-users] "Connection reset by peer" since upgrade from 2.2.x to 2.4.4

2009-03-10 04:10:50
Subject: [Bacula-users] "Connection reset by peer" since upgrade from 2.2.x to 2.4.4
From: Thomas Mueller <thomas AT chaschperli DOT ch>
To: bacula-users AT lists.sourceforge DOT net
Date: Tue, 10 Mar 2009 07:55:33 +0000 (UTC)
hi

since i've upgraded bacula from 2.2.x to 2.4.4 (both backport.org 
versions) the full backup (about 3TB) gets a "Connection reset by peer". 
dir, sd and fd are on the same network, connected to the same switch - so 
no NAT inbetween. with 2.2.x also 4TB backups were no problem. 

after searching the web, i've found many references to "heartbeat 
interval". turned on on dir, sd and fd. 

i've started the job 3 times, 2 times without "Heartbeat interval": they 
we're failing after 8h and 11h. the last one with "Heartbeat interval = 
1m" failed after 11h. 

joblog shows this:

09-Mar 21:26 filer-dir JobId 2396: Start Backup JobId 2396, Job=filer-
betsch.2009-03-09_21.26.09.06
09-Mar 21:26 filer-dir JobId 2396: Using Device "FileStorage"
09-Mar 21:26 filer-fd JobId 2396: ClientBeforeJob: run command "/etc/
bacula/scripts/filer-pre.sh filer-betsch.2009-03-09_21"
09-Mar 21:26 backup-sd JobId 2396: Wrote label to prelabeled Volume 
"Full-0002" on device "FileStorage" (/srv/backup/vtl)
10-Mar 08:05 filer-fd JobId 2396: Fatal error: backup.c:892 Network send 
error to SD. ERR=Connection reset by peer
10-Mar 08:05 filer-dir JobId 2396: Error: Bacula filer-dir 2.4.4 
(28Dec08): 10-Mar-2009 08:05:27
  Build OS:               x86_64-pc-linux-gnu debian 4.0
  JobId:                  2396
  Job:                    filer-betsch.2009-03-09_21.26.09.06
  Backup Level:           Full
  Client:                 "filer-fd" 2.4.4 (28Dec08) x86_64-pc-linux-
gnu,debian,4.0
  FileSet:                "filer-betsch-fs" 2008-07-04 17:21:40
  Pool:                   "Full" (From User input)
  Storage:                "File" (From Job resource)
  Scheduled time:         09-Mar-2009 21:26:09
  Start time:             09-Mar-2009 21:26:11
  End time:               10-Mar-2009 08:05:27
  Elapsed time:           10 hours 39 mins 16 secs
  Priority:               10
  FD Files Written:       1,559,995
  SD Files Written:       0
  FD Bytes Written:       2,087,925,378,284 (2.087 TB)
  SD Bytes Written:       0 (0 B)
  Rate:                   54435.4 KB/s
  Software Compression:   None
  VSS:                    no
  Storage Encryption:     no
  Volume name(s):         Full-0002
  Volume Session Id:      1
  Volume Session Time:    1236630215
  Last Volume Bytes:      2,088,866,268,781 (2.088 TB)
  Non-fatal FD errors:    0
  SD Errors:              0
  FD termination status:  Error
  SD termination status:  Error
  Termination:            *** Backup Error ***

both filer and backup hosts are debian etch with 2.6.26 (backports.org) 
kernel. 

any hints how to track down the problem? anyone had the same problem?

- Thomas




------------------------------------------------------------------------------
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users

<Prev in Thread] Current Thread [Next in Thread>