A Logistic Based Mathematical Model to Optimize Duplicate Elimination Ratio in Content Defined Chunking Based Big Data Storage System

Deduplication is an efficient data reduction technique, and it is used to mitigate the problem of huge data volume in big data storage systems. Content defined chunking (CDC) is the most widely used algorithm in deduplication systems. The expected chunk size is an important parameter of CDC, and it...

Full description

Bibliographic Details
Main Authors: Longxiang Wang, Xiaoshe Dong, Xingjun Zhang, Fuliang Guo, Yinfeng Wang, Weifeng Gong
Format: Article
Language:English
Published: MDPI AG 2016-07-01
Series:Symmetry
Subjects:
Online Access:http://www.mdpi.com/2073-8994/8/7/69