Nicke
ADSM.ORG Senior Member
Almost half the stgpool size with de-dup in 6.1.3.2
I think the native TSM 6.x de-dup works pretty good
One example is this VTL stgpool with 4,4 TB free space:
Storage Pool Name: ACTIVE_VTL_DD
Storage Pool Type: Primary
Device Class Name: VTLDD
Estimated Capacity: 7,682 G
Space Trigger Util: 94.0
Pct Util: 62.0
Pct Migr: 62.0
Pct Logical: 79.0
High Mig Pct: 90
Low Mig Pct: 70
Migration Delay: 0
Migration Continue: Yes
Migration Processes: 1
Reclamation Processes: 6
Next Storage Pool: ACTIVELTO
Reclaim Storage Pool:
Maximum Size Threshold: No Limit
Access: Read/Write
Description: Dedup-pool
Overflow Location:
Cache Migrated Files?:
Collocate?: Group
Reclamation Threshold: 60
Offsite Reclamation Limit:
Maximum Scratch Volumes Allowed: 760
Number of Scratch Volumes Used: 505
Delay Period for Volume Reuse: 0 Day(s)
Migration in Progress?: No
Amount Migrated (MB): 1,483,681.47
Elapsed Migration Time (seconds): 63,407
Reclamation in Progress?: No
Last Update by (administrator): XXXXX
Last Update Date/Time: 05/21/2010 12:05:12
Storage Pool Data Format: Native
Copy Storage Pool(s):
Active Data Pool(s):
Continue Copy on Error?: Yes
CRC Data: No
Reclamation Type: Threshold
Overwrite Data when Deleted:
Deduplicate Data?: Yes
Processes For Identifying Duplicates: 2
Duplicate Data Not Stored: 4 400 G (49%)
...
We only run 2 de-dup identity processes and we monitor the reclaim processes that free up volumes.
Process Process Description Status
Number
-------- -------------------- -------------------------------------------------
206 Identify Duplicates Storage pool: ACTIVE_VTL_DD. Volume: NONE. State:
idle. State Date/Time: 2010-06-24 08:26:52.
Current Physical File(bytes): 0. Total Files
Processed: 3388025. Total Duplicate Extents
Found: 27 705 973. Total Duplicate Bytes Found:
4 785 235 436 185.
207 Identify Duplicates Storage pool: ACTIVE_VTL_DD. Volume: NONE. State:
idle. State Date/Time: 2010-06-24 08:20:17.
Current Physical File(bytes): 0. Total Files
Processed: 598506. Total Duplicate Extents
Found: 7 594 202. Total Duplicate Bytes Found: 1
043 675 163 147.
...
So we are happy with the TSM 6 de-dup.
...
With IBM ProtecTIER or EMC Data Domain maybe it possible to get 3-6 times (300 - 600 %) de-dup ratio, but it will require a length and costly installation project and also with TSM you have always had a very good native VTL function so you don't need an external VTL product.
...
Kind Regards,
Nicke
I think the native TSM 6.x de-dup works pretty good
One example is this VTL stgpool with 4,4 TB free space:
Storage Pool Name: ACTIVE_VTL_DD
Storage Pool Type: Primary
Device Class Name: VTLDD
Estimated Capacity: 7,682 G
Space Trigger Util: 94.0
Pct Util: 62.0
Pct Migr: 62.0
Pct Logical: 79.0
High Mig Pct: 90
Low Mig Pct: 70
Migration Delay: 0
Migration Continue: Yes
Migration Processes: 1
Reclamation Processes: 6
Next Storage Pool: ACTIVELTO
Reclaim Storage Pool:
Maximum Size Threshold: No Limit
Access: Read/Write
Description: Dedup-pool
Overflow Location:
Cache Migrated Files?:
Collocate?: Group
Reclamation Threshold: 60
Offsite Reclamation Limit:
Maximum Scratch Volumes Allowed: 760
Number of Scratch Volumes Used: 505
Delay Period for Volume Reuse: 0 Day(s)
Migration in Progress?: No
Amount Migrated (MB): 1,483,681.47
Elapsed Migration Time (seconds): 63,407
Reclamation in Progress?: No
Last Update by (administrator): XXXXX
Last Update Date/Time: 05/21/2010 12:05:12
Storage Pool Data Format: Native
Copy Storage Pool(s):
Active Data Pool(s):
Continue Copy on Error?: Yes
CRC Data: No
Reclamation Type: Threshold
Overwrite Data when Deleted:
Deduplicate Data?: Yes
Processes For Identifying Duplicates: 2
Duplicate Data Not Stored: 4 400 G (49%)
...
We only run 2 de-dup identity processes and we monitor the reclaim processes that free up volumes.
Process Process Description Status
Number
-------- -------------------- -------------------------------------------------
206 Identify Duplicates Storage pool: ACTIVE_VTL_DD. Volume: NONE. State:
idle. State Date/Time: 2010-06-24 08:26:52.
Current Physical File(bytes): 0. Total Files
Processed: 3388025. Total Duplicate Extents
Found: 27 705 973. Total Duplicate Bytes Found:
4 785 235 436 185.
207 Identify Duplicates Storage pool: ACTIVE_VTL_DD. Volume: NONE. State:
idle. State Date/Time: 2010-06-24 08:20:17.
Current Physical File(bytes): 0. Total Files
Processed: 598506. Total Duplicate Extents
Found: 7 594 202. Total Duplicate Bytes Found: 1
043 675 163 147.
...
So we are happy with the TSM 6 de-dup.
...
With IBM ProtecTIER or EMC Data Domain maybe it possible to get 3-6 times (300 - 600 %) de-dup ratio, but it will require a length and costly installation project and also with TSM you have always had a very good native VTL function so you don't need an external VTL product.
...
Kind Regards,
Nicke