Bacula-users

[Bacula-users] FD not connecting to SD

2011-03-18 04:05:04
Subject: [Bacula-users] FD not connecting to SD
From: Lawrence Strydom <lawrence AT itcallcentre.co DOT za>
To: bacula-users AT lists.sourceforge DOT net
Date: Fri, 18 Mar 2011 10:00:59 +0200
Hi List,

I have been tasked to test bacula as a possible backup solution for my organisation and I configured a simple lab using two VMWare Server V1.0.6 instances, one as the bacula server and one as the client on my WinXP test machine.
The VMWare servers both run openSuSE11.3.

I used "Host Only" networking where the two Vmware servers share a private network with the host.

VM Bacula server running Bacula 5.0.3 compiled with the following configure options;

--enable-smartalloc --sbindir=/usr/local/bacula/bin --sysconfdir=/usr/local/bacula/bin -with-mysql -with-openssl -enable-bat -sysconfdir=/etc/bacula -enable-tray-monitor


VM Bacula client running Bacula 5.0.3 compiled with
 --enable-client-only
configure option.

bacula-dir.conf on VM Bacula server:
###################################################################################################
Director {                           
  Name = backup01-dir
  DIRport = 9101             
  QueryFile = "/etc/bacula/query.sql"
  WorkingDirectory = "/var/bacula/working"
  PidDirectory = "/var/run"
  Maximum Concurrent Jobs = 1
  Password = "NyYEic1bpHXLQaIRA6bgcaMsAXUWDmRmAxzV6dR+A3G/"        
  Messages = Daemon
}


JobDefs {
  Name = "DefaultJob"
  Type = Backup
  Level = Incremental
  Client = backup01-fd
  FileSet = "Full Set"
  Schedule = "WeeklyCycle"
  Accurate = Yes
  Messages = Standard
  Pool = Full-pool
  Priority = 10
  Write Bootstrap = "/var/bacula/working/%c.bsr"
}

Job {
  Name = "Backup01_selfbackup"
  JobDefs = "DefaultJob"
}

Job {
  Name = "BackupCatalog"
  JobDefs = "DefaultJob"
  Level = Full
  FileSet="Catalog"
  Schedule = "WeeklyCycleAfterBackup"
   RunBeforeJob = "/etc/bacula/make_catalog_backup.pl MyCatalog"
   RunAfterJob  = "/etc/bacula/delete_catalog_backup"
  Write Bootstrap = "/var/bacula/working/%n.bsr"
  Priority = 11                  
}

Job {
  Name = "RestoreFiles"
  Type = Restore
  Client=backup01-fd
  FileSet="Full Set"
  Storage = Full
  Pool = Default
  Messages = Standard
  Where = /tmp/bacula-restores
}

FileSet {
  Name = "Full Set"
  Include {
    Options {
      signature = MD5
    }

    File = /usr/local/bacula/bin
    File = /etc/bacula
  }

  Exclude {
    File = /var/bacula/working
    File = /tmp
    File = /proc
    File = /tmp
    File = /.journal
    File = /.fsck
  }
}

Schedule {
  Name = "WeeklyCycle"
  Run = Level=Full FullPool=Full-pool Storage=Full tue at 08:05
  Run = Level=Incremental IncrementalPool=Inc-pool Storage=Incr mon at 08:05
  Run = Level=Incremental IncrementalPool=Inc-pool Storage=Incr tue-fri at 08:05
}

Schedule {
  Name = "WeeklyCycleAfterBackup"
  Run = Full sun-sat at 23:10
}

FileSet {
  Name = "Catalog"
  Include {
    Options {
      signature = MD5
    }
    File = "/var/bacula/working/bacula.sql"
  }
}

Client {
  Name = backup01-fd
  Address = localhost
  FDPort = 9102
  Catalog = MyCatalog
  Password = "BhPuYwB0ytlQPAjPe086Qi/zThuG9oaJNhe+KrxoDUAO"          # password for FileDaemon
  File Retention = 30 days            # 30 days
  Job Retention = 6 months            # six months
  AutoPrune = yes                     # Prune expired Jobs/Files
}

Storage {
  Name = Full
# Do not use "localhost" here
  Address = localhost                # N.B. Use a fully qualified name here
  SDPort = 9103
  Password = "khnz/SHLgz8TfygpeMsUuEf00NinIsDDFQOUBtJ8KCLj"
  Device = Full
  Media Type = File
}

Storage {
  Name = Incr
# Do not use "localhost" here
  Address = localhost                # N.B. Use a fully qualified name here
  SDPort = 9103
  Password = "khnz/SHLgz8TfygpeMsUuEf00NinIsDDFQOUBtJ8KCLj"
  Device = Incr
  Media Type = File
}

Catalog {
  Name = MyCatalog
  dbname = "bacula"; dbuser = "bacula"; dbpassword = ""
}

Messages {
  Name = Standard
  append = "/var/bacula/working/log" = all, !skipped
  catalog = all
}

Messages {
  Name = Daemon
  mailcommand = "/usr/local/bacula/bin/bsmtp -h localhost -f \"\(Bacula\) \<%r\>\" -s \"Bacula daemon message\" %r"
  mail = root@localhost = all, !skipped
  console = all, !skipped, !saved
  append = "/var/bacula/working/log" = all, !skipped
}

Pool {
  Name = Default
  Pool Type = Backup
  Recycle = yes                       # Bacula can automatically recycle Volumes
  AutoPrune = yes                     # Prune expired volumes
  Volume Retention = 365 days         # one year
}


Pool {
Name = Full-pool
Storage = Full
Pool Type = Backup
Recycle Oldest Volume = yes
AutoPrune = yes
Volume Retention = 30 days
Maximum Volume Bytes = 50G
Label Format = Full-
Maximum Volumes = 100
}

Pool {
Name = Inc-pool
Storage = Incr
Pool Type = Backup
Recycle Oldest Volume = yes
AutoPrune = yes
Volume Retention = 7 days
Label Format = Inc-
Maximum Volume Bytes = 50G
Maximum Volumes = 100
}

Console {
  Name = backup01-mon
  Password = "xjSmm1K+l1GRVFYMSKlZXWd1a76xrhgyZciqOl/QnArY"
  CommandACL = status, .status
}

Job {
  Name = "jls_backup"
  Client = jls
  JobDefs = "DefaultJob"
  Fileset = "jls_set"
}

Client {
  Name = jls
  Address = 192.168.81.130
  FDPort = 9102
  Catalog = MyCatalog
  Password = "BhPuYwB0ytlQPAjPe086Qi/zThuG9oaJNhe+KrxoDUAO"          # password for FileDaemon
  File Retention = 30 days            # 30 days
  Job Retention = 6 months            # six months
  AutoPrune = yes                     # Prune expired Jobs/Files
}

FileSet {
  Name = "jls_set"
  Include {
    Options {
      signature = MD5
    }
    File = "/home/jls/backups"
   }
}

#############################################################################################################



bacula-sd.conf on VM Bacula server:
##############################################################################################################

Storage {                           
  Name = backup01-sd
  SDPort = 9103                 
  WorkingDirectory = "/var/bacula/working"
  Pid Directory = "/var/run"
  Maximum Concurrent Jobs = 20
}


Director {
  Name = backup01-dir
  Password = "khnz/SHLgz8TfygpeMsUuEf00NinIsDDFQOUBtJ8KCLj"
}

Director {
  Name = backup01-mon
  Password = "ltdZTDTd5+dQHXTXoyg5+6MDTKYiRFVTQbHM/yyejjHs"
  Monitor = yes
}


Device {
  Name = Full
  Media Type = File
  Archive Device = /home/backups/full
  LabelMedia = yes;                  
  Random Access = Yes;
  AutomaticMount = yes;             
  RemovableMedia = no;
  AlwaysOpen = no;
  LabelMedia = Yes;
}

Device {
  Name = Incr
  Media Type = File
  Archive Device = /home/backups/incr
  LabelMedia = yes;                 
  Random Access = Yes;
  AutomaticMount = yes;             
  RemovableMedia = no;
  LabelMedia = Yes;
  AlwaysOpen = no;
}

##########################################################################################################
#

I know client side FD is configured correctly. Here is status output:

Connecting to Client jls at 192.168.81.130:9102

linux-c3zj-fd Version: 5.0.3 (04 August 2010)  i686-pc-linux-gnu suse 11.3
Daemon started 17-Mar-11 21:11. Jobs: run=0 running=0.
 Heap: heap=135,168 smbytes=16,119 max_bytes=19,260 bufs=60 max_bufs=81
 Sizeof: boffset_t=8 size_t=4 debug=0 trace=0

Running Jobs:
Director connected at: 18-Mar-11 08:02
No Jobs running.
====

Terminated Jobs:
====


I have run the selfbackup for the VM Bacula server successfully to the "Full" storage device:

18-Mar 08:05 backup01-dir JobId 20: Start Backup JobId 20, Job=Backup01_selfbackup.2011-03-18_08.05.00_10
18-Mar 08:05 backup01-dir JobId 20: Using Device "Full"
18-Mar 08:05 backup01-sd JobId 20: Volume "Full-0001" previously written, moving to end of data.
18-Mar 08:05 backup01-sd JobId 20: Ready to append to end of Volume "Full-0001" size=23215038
18-Mar 08:05 backup01-sd JobId 20: Job write elapsed time = 00:00:01, Transfer rate = 23.18 M Bytes/second
18-Mar 08:05 backup01-dir JobId 20: Bacula backup01-dir 5.0.3 (04Aug10): 18-Mar-2011 08:05:03
  Build OS:               i686-pc-linux-gnu suse 11.3
  JobId:                  20
  Job:                    Backup01_selfbackup.2011-03-18_08.05.00_10
  Backup Level:           Full
  Client:                 "backup01-fd" 5.0.3 (04Aug10) i686-pc-linux-gnu,suse,11.3
  FileSet:                "Full Set" 2011-03-17 17:32:53
  Pool:                   "Full-pool" (From Job resource)
  Catalog:                "MyCatalog" (From Client resource)
  Storage:                "Full" (From Pool resource)
  Scheduled time:         18-Mar-2011 08:04:52
  Start time:             18-Mar-2011 08:05:02
  End time:               18-Mar-2011 08:05:03
  Elapsed time:           1 sec
  Priority:               10
  FD Files Written:       54
  SD Files Written:       54
  FD Bytes Written:       23,176,668 (23.17 MB)
  SD Bytes Written:       23,182,411 (23.18 MB)
  Rate:                   23176.7 KB/s
  Software Compression:   None
  VSS:                    no
  Encryption:             no
  Accurate:               yes
  Volume name(s):         Full-0001
  Volume Session Id:      5
  Volume Session Time:    1300391435
  Last Volume Bytes:      46,416,859 (46.41 MB)
  Non-fatal FD errors:    0
  SD Errors:              0
  FD termination status:  OK
  SD termination status:  OK
  Termination:            Backup OK



But when I try to run the VM Client the job hangs here:

Running Jobs:
Console connected at 18-Mar-11 08:06
 JobId Level   Name                       Status
======================================================================
    21 Full    jls_backup.2011-03-18_08.07.10_12 is waiting for Client jls to connect to Storage Full
====


and;

Connecting to Client jls at 192.168.81.130:9102

linux-c3zj-fd Version: 5.0.3 (04 August 2010)  i686-pc-linux-gnu suse 11.3
Daemon started 17-Mar-11 21:11. Jobs: run=0 running=0.
 Heap: heap=135,168 smbytes=19,173 max_bytes=19,260 bufs=80 max_bufs=81
 Sizeof: boffset_t=8 size_t=4 debug=0 trace=0

Running Jobs:
JobId 21 Job jls_backup.2011-03-18_08.07.10_12 is running.
    Full System or Console Job started: 18-Mar-11 08:07
    Files=0 Bytes=0 Bytes/sec=0 Errors=0
    Files Examined=0
    SDSocket closed.
Director connected at: 18-Mar-11 08:08
====




and eventually terminates in error:

18-Mar 05:31 linux-c3zj-fd JobId 18: Fatal error: bsock.c:134 Unable to connect to Storage daemon on localhost:9103. ERR=Connection refused
18-Mar 05:31 linux-c3zj-fd JobId 18: Fatal error: Failed to connect to Storage daemon: localhost:9103
18-Mar 05:31 backup01-dir JobId 18: Fatal error: Bad response to Storage command: wanted 2000 OK storage
, got 2902 Bad storage

18-Mar 05:31 backup01-dir JobId 18: Error: Bacula backup01-dir 5.0.3 (04Aug10): 18-Mar-2011 05:31:34
  Build OS:               i686-pc-linux-gnu suse 11.3
  JobId:                  18
  Job:                    jls_backup.2011-03-17_21.53.00_05
  Backup Level:           Full (upgraded from Incremental)
  Client:                 "jls" 5.0.3 (04Aug10) i686-pc-linux-gnu,suse,11.3
  FileSet:                "jls_set" 2011-03-17 19:24:54
  Pool:                   "Full-pool" (From Job resource)
  Catalog:                "MyCatalog" (From Client resource)
  Storage:                "Full" (From Pool resource)
  Scheduled time:         17-Mar-2011 21:53:00
  Start time:             17-Mar-2011 21:53:04
  End time:               18-Mar-2011 05:31:34
  Elapsed time:           7 hours 38 mins 30 secs
  Priority:               10
  FD Files Written:       0
  SD Files Written:       0
  FD Bytes Written:       0 (0 B)
  SD Bytes Written:       0 (0 B)
  Rate:                   0.0 KB/s
  Software Compression:   None
  VSS:                    no
  Encryption:             no
  Accurate:               yes
  Volume name(s):
  Volume Session Id:      3
  Volume Session Time:    1300391435
  Last Volume Bytes:      23,214,420 (23.21 MB)
  Non-fatal FD errors:    0
  SD Errors:              0
  FD termination status:  Error
  SD termination status:  Waiting on FD
  Termination:            *** Backup Error ***



I can telnet to port 9103 from the client side and the connection is accepted:

linux-c3zj:~ # telnet 192.168.81.129 9103
Trying 192.168.81.129...
Connected to 192.168.81.129.
Escape character is '^]'.




OK so it seems the network layer is sorted but I found an earlier post talking about acket sizes errors - I once saw an error like that in the log but it has gone away which leads me to think maybe the VMware virtual network layer is incompattible?

Or am i missing something really obvious here?

Thanks in advance

Lawrence









------------------------------------------------------------------------------
Colocation vs. Managed Hosting
A question and answer guide to determining the best fit
for your organization - today and in the future.
http://p.sf.net/sfu/internap-sfd2d
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users
<Prev in Thread] Current Thread [Next in Thread>