Bacula-users

[Bacula-users] Rif: "Connection reset by peer" since upgrade from 2.2.x to 2.4.4

2009-03-10 07:08:24
Subject: [Bacula-users] Rif: "Connection reset by peer" since upgrade from 2.2.x to 2.4.4
From: Ferdinando Pasqualetti <fpasqual AT ccci DOT it>
To: bacula-users AT lists.sourceforge DOT net
Date: Tue, 10 Mar 2009 11:14:31 +0100

Hi Thomas,
are you sure your SD filesystem supports file sizes more then 2 TB? If this is the probllem  you should set the MaximumVolumeSize in the pool.

--------------------------------------------------------------------------
Ferdinando Pasqualetti
G.T.Dati srl
Tel. 0557310862 - 3356172731 - Fax 055720143





Thomas Mueller <thomas AT chaschperli DOT ch>

10/03/2009 08.55

Per
bacula-users AT lists.sourceforge DOT net
CC
Oggetto
[Bacula-users] "Connection reset by peer" since upgrade from 2.2.x        to 2.4.4





hi

since i've upgraded bacula from 2.2.x to 2.4.4 (both backport.org
versions) the full backup (about 3TB) gets a "Connection reset by peer".
dir, sd and fd are on the same network, connected to the same switch - so
no NAT inbetween. with 2.2.x also 4TB backups were no problem.

after searching the web, i've found many references to "heartbeat
interval". turned on on dir, sd and fd.

i've started the job 3 times, 2 times without "Heartbeat interval": they
we're failing after 8h and 11h. the last one with "Heartbeat interval =
1m" failed after 11h.

joblog shows this:

09-Mar 21:26 filer-dir JobId 2396: Start Backup JobId 2396, Job=filer-
betsch.2009-03-09_21.26.09.06
09-Mar 21:26 filer-dir JobId 2396: Using Device "FileStorage"
09-Mar 21:26 filer-fd JobId 2396: ClientBeforeJob: run command "/etc/
bacula/scripts/filer-pre.sh filer-betsch.2009-03-09_21"
09-Mar 21:26 backup-sd JobId 2396: Wrote label to prelabeled Volume
"Full-0002" on device "FileStorage" (/srv/backup/vtl)
10-Mar 08:05 filer-fd JobId 2396: Fatal error: backup.c:892 Network send
error to SD. ERR=Connection reset by peer
10-Mar 08:05 filer-dir JobId 2396: Error: Bacula filer-dir 2.4.4
(28Dec08): 10-Mar-2009 08:05:27
 Build OS:               x86_64-pc-linux-gnu debian 4.0
 JobId:                  2396
 Job:                    filer-betsch.2009-03-09_21.26.09.06
 Backup Level:           Full
 Client:                 "filer-fd" 2.4.4 (28Dec08) x86_64-pc-linux-
gnu,debian,4.0
 FileSet:                "filer-betsch-fs" 2008-07-04 17:21:40
 Pool:                   "Full" (From User input)
 Storage:                "File" (From Job resource)
 Scheduled time:         09-Mar-2009 21:26:09
 Start time:             09-Mar-2009 21:26:11
 End time:               10-Mar-2009 08:05:27
 Elapsed time:           10 hours 39 mins 16 secs
 Priority:               10
 FD Files Written:       1,559,995
 SD Files Written:       0
 FD Bytes Written:       2,087,925,378,284 (2.087 TB)
 SD Bytes Written:       0 (0 B)
 Rate:                   54435.4 KB/s
 Software Compression:   None
 VSS:                    no
 Storage Encryption:     no
 Volume name(s):         Full-0002
 Volume Session Id:      1
 Volume Session Time:    1236630215
 Last Volume Bytes:      2,088,866,268,781 (2.088 TB)
 Non-fatal FD errors:    0
 SD Errors:              0
 FD termination status:  Error
 SD termination status:  Error
 Termination:            *** Backup Error ***

both filer and backup hosts are debian etch with 2.6.26 (backports.org)
kernel.

any hints how to track down the problem? anyone had the same problem?

- Thomas




------------------------------------------------------------------------------
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users

------------------------------------------------------------------------------
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users
<Prev in Thread] Current Thread [Next in Thread>