Bacula-users

[Bacula-users] Problem with backup failing after 2 hours

2010-03-16 12:34:27
Subject: [Bacula-users] Problem with backup failing after 2 hours
From: Jerry Lowry <jlowry AT edt DOT com>
To: bacula-users AT lists.sourceforge DOT net
Date: Tue, 16 Mar 2010 09:32:15 -0700
Hi,
I have a new installation that I am tweeking the backups on.  One of my 
backups goes through the firewall from an public ip address to where the 
backup server is on an private  ip address.  The backup works just fine 
for the first 2h 11m 15s and then it fails.  After the first test failed 
I inserted the 'Heartbeat Interval' option and set it for 30 sec.  This 
slowed down the backup but I wanted to make sure that it continued 
through the entire disk.  The disk that I am backing up has closed to 
250GB of data on it.   I get approx. 50GB backed up before it fails.  Is 
there any other setting that will help this finish.
The logs are here:

First attempt:

15-Mar 10:26 distress-sd JobId 38: Labeled new Volume "hardware-0010" on device 
"FileStorage1" (/backup1).
15-Mar 10:26 distress-sd JobId 38: Wrote label to prelabeled Volume 
"hardware-0010" on device "FileStorage1" (/backup1)
15-Mar 12:37 distress-dir JobId 38: Fatal error: Network error with FD during 
Backup: ERR=Connection timed out
15-Mar 12:37 distress-sd JobId 38: JobId=38 
Job="BackupHardware.2010-03-15_10.26.42_06" marked to be canceled.
15-Mar 12:37 distress-sd JobId 38: Job write elapsed time = 02:11:15, Transfer 
rate = 8.120 M Bytes/second
15-Mar 12:37 distress-sd JobId 38: Error: bsock.c:529 Read expected 65536 got 
14596 from client:70.99.222.36:36643
15-Mar 12:37 distress-dir JobId 38: Fatal error: No Job status returned from FD.
15-Mar 12:37 distress-dir JobId 38: Error: Bacula distress-dir 5.0.1 (24Feb10): 
15-Mar-2010 12:37:59
  Build OS:               x86_64-unknown-linux-gnu redhat 
  JobId:                  38
  Job:                    BackupHardware.2010-03-15_10.26.42_06
  Backup Level:           Full (upgraded from Incremental)
  Client:                 "swift-fd" 5.0.1 (24Feb10) 
sparc-sun-solaris2.10,solaris,5.10
  FileSet:                "Swift Hardware Set" 2010-03-10 11:30:53
  Pool:                   "Pool1" (From Job resource)
  Catalog:                "MyCatalog" (From Client resource)
  Storage:                "File1" (From command line)
  Scheduled time:         15-Mar-2010 10:26:34
  Start time:             15-Mar-2010 10:26:44
  End time:               15-Mar-2010 12:37:59
  Elapsed time:           2 hours 11 mins 15 secs
  Priority:               10
  FD Files Written:       0
  SD Files Written:       707,122
  FD Bytes Written:       0 (0 B)
  SD Bytes Written:       63,952,326,117 (63.95 GB)
  Rate:                   0.0 KB/s
  Software Compression:   None
  VSS:                    no
  Encryption:             no
  Accurate:               no
  Volume name(s):         hardware-0010
  Volume Session Id:      1
  Volume Session Time:    1268673669
  Last Volume Bytes:      64,023,383,638 (64.02 GB)
  Non-fatal FD errors:    0
  SD Errors:              1
  FD termination status:  Error
  SD termination status:  Canceled
  Termination:            *** Backup Error ***

Second attempt:

15-Mar 17:14 distress-dir JobId 40: Fatal error: Network error with FD during 
Backup: ERR=Connection timed out
15-Mar 17:14 distress-sd JobId 40: JobId=40 
Job="BackupHardware.2010-03-15_15.02.58_10" marked to be canceled.
15-Mar 17:14 distress-sd JobId 40: Job write elapsed time = 01:57:35, Transfer 
rate = 7.432 M Bytes/second
15-Mar 17:14 distress-dir JobId 40: Fatal error: No Job 15-Mar 17:14 
distress-dir JobId 40: Error: Bacula distress-dir 5.0.1 (24Feb10): 15-Mar-2010 
17:14:15
  Build OS:               x86_64-unknown-linux-gnu redhat 
  JobId:                  40
  Job:                    BackupHardware.2010-03-15_15.02.58_10
  Backup Level:           Full (upgraded from Incremental)
  Client:                 "swift-fd" 5.0.1 (24Feb10) 
sparc-sun-solaris2.10,solaris,5.10
  FileSet:                "Swift Hardware Set" 2010-03-10 11:30:53
  Pool:                   "Pool1" (From Job resource)
  Catalog:                "MyCatalog" (From Client resource)
  Storage:                "File1" (From command line)
  Scheduled time:         15-Mar-2010 15:02:57
  Start time:             15-Mar-2010 15:03:00
  End time:               15-Mar-2010 17:14:15
  Elapsed time:           2 hours 11 mins 15 secs
  Priority:               10
  FD Files Written:       0
  SD Files Written:       0
  FD Bytes Written:       0 (0 B)
  SD Bytes Written:       0 (0 B)
  Rate:                   0.0 KB/s
  Software Compression:   None
  VSS:                    no
  Encryption:             no
  Accurate:               no
  Volume name(s):         hardware-0012
  Volume Session Id:      3
  Volume Session Time:    1268673669
  Last Volume Bytes:      51,996,669,624 (51.99 GB)
  Non-fatal FD errors:    0
  SD Errors:              0
  FD termination status:  Error
  SD termination status:  Running
  Termination:            *** Backup Error ***
status returned from FD.

thanks,
-- 

---------------------------------------------------------------------------
Jerold Lowry
IT Manager / Software Engineer
Engineering Design Team (EDT), Inc. a HEICO company
1400 NW Compton Drive, Suite 315
Beaverton, Oregon 97006 (U.S.A.)
Phone: 503-690-1234 / 800-435-4320
Fax: 503-690-1243
Web: _www.edt.com <http://www.edt.com/>_

 



------------------------------------------------------------------------------
Download Intel&#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users

<Prev in Thread] Current Thread [Next in Thread>