Bacula-users

Re: [Bacula-users] Problems with De-Duplication

2015-08-19 11:41:38
Subject: Re: [Bacula-users] Problems with De-Duplication
From: "E.L.L. Assoua" <ellassoua AT gmail DOT com>
To: Heitor Faria <heitor AT bacula.com DOT br>
Date: Wed, 19 Aug 2015 17:36:31 +0200
Works!
Thanks Heitor!

Best regards,

Luc Assoua
Wednesday, August 19, 2015 3:41 PM via Postbox
Could you check out my configuration?
I do not normally do that since it's consultancy and you will probably wont create more synapses. =)
Anyway it goes:
I've configured De-duplication on a Bacula 7.0.5.9 - CentOS 7 ( client-fd is version 5.2 - Windows 8.1 Pro x64 ) with the following properties:
A specific Pool for the base volumes:

Pool {
  Name = BaseJobs
  Pool Type = Backup
  Label Format = "${Level}-${Job}-${Day}-${Month}-${Year}-${Hour:p/2/0/r}${Minute:p/2/0/r}${Second:p/2/0/r}"
  Use Volume _Once_ = yes
}

The specific FileSet for the base job:

FileSet {
  Name = "BaseFileSet_Windows8.1"
  Enable VSS = yes
  Include {
    Options {
You must have at the fileset options:

       BaseJob  = pmugcs5
       Accurate = mcs
       Verify   = pin5
      signature = MD5
      compression = GZIP
      }
    File = C:/Windows/
  }
}

The base job:

Job {
  Name = "BaseJob_Windows8.1"
  Type = Backup
  Level = Base
  Client = tom-helpdesk-fd
  FileSet = BaseFileSet_Windows8.1
  Pool = BaseJobs
  Storage = ns-bacula-sd
  Messages = Standard
}

The JobDefs for the full backup job:

JobDefs {
  Name = "Windows_Backup"
  Type = Backup
  Level = Full
  FileSet = "Windows FileSet"
  Messages = Standard
  SpoolAttributes = yes
  Priority = 10
  Write Bootstrap = "/var/spool/bacula/%c.bsr"
  Storage = ns-bacula-sd
}

The full backup job:

Job {
  Name = "FrankPC-Backup"
  JobDefs = "Windows_Backup"
You must have the directive here:
   Accurate = yes
  Base = "BaseJob_Windows8.1"
To the magic happen you must compare the base job to the job itself, between commas:

Base ="FrankPC-Backup", "BaseJob_Windows8.1"
  Client = tom-helpdesk-fd
  Schedule = "FrankPC_Backup"
  Pool = RegularBackups
}

The client

Client {
  Name = tom-helpdesk-fd
  Address = 10.13.37.103
  FDPort = 9102
  Catalog = MyCatalog
  Password = "~XXXXXXX"          # password for FileDaemon
  File Retention = 60 days            # 60 days
  Job Retention = 6 months            # six months
  AutoPrune = yes                     # Prune expired Jobs/Files

}

Beste regard,

Luc Assoua

Wednesday, August 19, 2015 3:24 PM via Postbox
Dear Heitor,

Could you check out my configuration?

I've configured De-duplication on a Bacula 7.0.5.9 - CentOS 7 ( client-fd is version 5.2 - Windows 8.1 Pro x64 ) with the following properties:

A specific Pool for the base volumes:

Pool {
  Name = BaseJobs
  Pool Type = Backup
  Label Format = "${Level}-${Job}-${Day}-${Month}-${Year}-${Hour:p/2/0/r}${Minute:p/2/0/r}${Second:p/2/0/r}"
  Use Volume _Once_ = yes
}

The specific FileSet for the base job:

FileSet {
  Name = "BaseFileSet_Windows8.1"
  Enable VSS = yes
  Include {
    Options {
      signature = MD5
      compression = GZIP
      }
    File = C:/Windows/
  }
}

The base job:

Job {
  Name = "BaseJob_Windows8.1"
  Type = Backup
  Level = Base
  Client = tom-helpdesk-fd
  FileSet = BaseFileSet_Windows8.1
  Pool = BaseJobs
  Storage = ns-bacula-sd
  Messages = Standard
}

The JobDefs for the full backup job:

JobDefs {
  Name = "Windows_Backup"
  Type = Backup
  Level = Full
  FileSet = "Windows FileSet"
  Messages = Standard
  SpoolAttributes = yes
  Priority = 10
  Write Bootstrap = "/var/spool/bacula/%c.bsr"
  Storage = ns-bacula-sd
}

The full backup job:

Job {
  Name = "FrankPC-Backup"
  JobDefs = "Windows_Backup"
  Base = "BaseJob_Windows8.1"
  Client = tom-helpdesk-fd
  Schedule = "FrankPC_Backup"
  Pool = RegularBackups
}

The client

Client {
  Name = tom-helpdesk-fd
  Address = 10.13.37.103
  FDPort = 9102
  Catalog = MyCatalog
  Password = "~XXXXXXX"          # password for FileDaemon
  File Retention = 60 days            # 60 days
  Job Retention = 6 months            # six months
  AutoPrune = yes                     # Prune expired Jobs/Files

}

Beste regard,

Luc Assoua
Wednesday, August 19, 2015 3:14 PM via Postbox
Hello guys,

1. Bacula Enterprise Global deduplication but it's more automated and efficient than the Community dedup. In fact it works beyond file granularity, but in block level at the storage: http://www.baculasystems.com/wp-content/uploads/bacula-enterprise-v8-global-endpoint-deduplication2.pdf 
2. If you made a base backup jobs using Bacula Community and set the original Backup job correctly to compare with the Base job, probably you will backup the C:/Windows again cause, of course, there are files under it that are always changing. But it's only expected to copy the files that changed in relation to the ones copied by the Base Job.

Regards,
===========================================================================
Heitor Medrado de Faria - LPIC-III | ITIL-F |  Bacula Systems Certified Administrator II
I do Bacula training and deploy in any city of the world. More information: http://bacula.us/
===========================================================================



Wednesday, August 19, 2015 2:54 PM via Postbox
Deduplication is not an option???
I thought it was a plugin that was available for purchase....

Wednesday, August 19, 2015 2:38 PM via Postbox
Nobody uses the de-duplication option?

--
Sent from Postbox
------------------------------------------------------------------------------
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users