A Logistic Based Mathematical Model to Optimize Duplicate Elimination Ratio in Content Defined Chunking Based Big Data Storage System
Deduplication is an efficient data reduction technique, and it is used to mitigate the problem of huge data volume in big data storage systems. Content defined chunking (CDC) is the most widely used algorithm in deduplication systems. The expected chunk size is an important parameter of CDC, and it...
Main Authors: | Longxiang Wang, Xiaoshe Dong, Xingjun Zhang, Fuliang Guo, Yinfeng Wang, Weifeng Gong |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2016-07-01
|
Series: | Symmetry |
Subjects: | |
Online Access: | http://www.mdpi.com/2073-8994/8/7/69 |
Similar Items
-
Data Deduplication System Based on Content-Defined Chunking Using Bytes Pair Frequency Occurrence
by: Ahmed Sardar M. Saeed, et al.
Published: (2020-11-01) -
Double Sliding Window Chunking Algorithm for Data Deduplication in Ocean Observation
by: Shuai Guo, et al.
Published: (2023-01-01) -
A Design of Parallel Content-Defined Chunking System Using Non-Hashing Algorithms on FPGA
by: Hung Vuong, et al.
Published: (2022-01-01) -
Decomposing a Chunk into Its Elements and Reorganizing Them As a New Chunk: The Two Different Sub-processes Underlying Insightful Chunk Decomposition
by: Xiaofei Wu, et al.
Published: (2017-11-01) -
Lightweight hash-based de-duplication system using the self detection of most repeated patterns as chunks divisors
by: Saja Taha Ahmed, et al.
Published: (2022-07-01)