BackupPC-users

Re: [BackupPC-users] improving the deduplication ratio

2008-04-14 19:30:15
Subject: Re: [BackupPC-users] improving the deduplication ratio
From: Les Mikesell <lesmikesell AT gmail DOT com>
To: Ludovic Drolez <ldrolez AT debian DOT org>
Date: Mon, 14 Apr 2008 18:30:06 -0500
Ludovic Drolez wrote:
> On Wed, Apr 09, 2008 at 10:12:09AM -0500, Les Mikesell wrote:
>> I'd probably look at what rdiff-backup does with incremental differences 
>> and instead of chunking everything, just track changes where the 
>> differences are small.
> 
> Yes but rdiff-backup has no pooling/deduplication.

You get the same effect within a single host.  That is you can restore 
states from multiple times without keeping full copies of each.

> With that feature, backuppc would be closer to rdiff-backup with
> pooling on top of that.

Yes, I wouldn't expect many random matches from chunked files except in 
the special cases of growing logfiles or small changes to large 
databases.  If the rsync process built the difference files like 
rdiff-backup and then pooled them where it would save space compared to 
a new copy it might be a big win.  But, it would have to be just for 
incrementals or it would be tricky to keep track of dependencies when 
expiring earlier runs.

-- 
   Les Mikesell
    lesmikesell AT gmail DOT com

-------------------------------------------------------------------------
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference 
Don't miss this year's exciting event. There's still time to save $100. 
Use priority code J8TL2D2. 
http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
_______________________________________________
BackupPC-users mailing list
BackupPC-users AT lists.sourceforge DOT net
List:    https://lists.sourceforge.net/lists/listinfo/backuppc-users
Wiki:    http://backuppc.wiki.sourceforge.net
Project: http://backuppc.sourceforge.net/