ADSM-L

Re: [ADSM-L] Deduplication

2017-04-10 12:18:39
Subject: Re: [ADSM-L] Deduplication
From: Del Hoobler <hoobler AT US.IBM DOT COM>
To: ADSM-L AT VM.MARIST DOT EDU
Date: Mon, 10 Apr 2017 12:11:48 -0400
Hi Eric,

A few things:

- Client-side provides better overall throughput for Spectrum Protect 
because the deduplication is spread across more CPU's. So if you can 
afford to do the deduplication client-side, that is the best overall 
result.

- Client-side helps reduce network traffic

- The algorithms on how deduplication is performed are the same between 
client and server.


The behavior you are seeing has to do with the reusedelay impact on 
deduplicated chunks. If the reusedelay is 1 day (default), that means 
Spectrum Protect keeps the deduplicated chunks pinned in storage until 
that time has passed. If the reusedelay is 0, there is a still a small 
cushion window that might allow the chunks to still be linked to. If you 
waited for a couple of hours AFTER the deletion occurred, I would not 
expect those chunks to be reused. 



Del

----------------------------------------------------

"ADSM: Dist Stor Manager" <ADSM-L AT VM.MARIST DOT EDU> wrote on 04/10/2017 
10:57:27 AM:

> From: "Loon, Eric van (ITOPT3) - KLM" <Eric-van.Loon AT KLM DOT COM>
> To: ADSM-L AT VM.MARIST DOT EDU
> Date: 04/10/2017 11:01 AM
> Subject: Deduplication
> Sent by: "ADSM: Dist Stor Manager" <ADSM-L AT VM.MARIST DOT EDU>
> 
> Hi guys!
> We are trying to make a fair comparison between server-  and client-
> side deduplication. I'm running into an 'issue' where I notice that 
> once you created a backup of a certain set of data, it is always 
> deduplicated 100% afterwards when you start a new client-side 
> deduped backup. Even when you delete all previous backup on the server 
first!
> So I backed up a directory, retrieved all objectids through a select
> * from backups and deleted all objects, but still a new backup is 
> deduplicated 100%. I don't understand why. I though it maybe had 
> something to do with data still being in the container pool, but 
> even with reusdelay=0, everything is deduplicated...
> Thanks for any help (Andy? :)) in advance.
> Kind regards,
> Eric van Loon
> Air France/KLM Storage Engineering
> ********************************************************
> For information, services and offers, please visit our web site: 
> http://www.klm.com. This e-mail and any attachment may contain 
> confidential and privileged material intended for the addressee 
> only. If you are not the addressee, you are notified that no part of
> the e-mail or any attachment may be disclosed, copied or 
> distributed, and that any other action related to this e-mail or 
> attachment is strictly prohibited, and may be unlawful. If you have 
> received this e-mail by error, please notify the sender immediately 
> by return e-mail, and delete this message.
> 
> Koninklijke Luchtvaart Maatschappij NV (KLM), its subsidiaries and/
> or its employees shall not be liable for the incorrect or incomplete
> transmission of this e-mail or any attachments, nor responsible for 
> any delay in receipt.
> Koninklijke Luchtvaart Maatschappij N.V. (also known as KLM Royal 
> Dutch Airlines) is registered in Amstelveen, The Netherlands, with 
> registered number 33014286
> ********************************************************
> 

<Prev in Thread] Current Thread [Next in Thread>