DeDu: Building a deduplication storage system over cloud computing

  title={DeDu: Building a deduplication storage system over cloud computing},
  author={Zhe Sun and Jun Shen and Jianming Yong},
  journal={Proceedings of the 2011 15th International Conference on Computer Supported Cooperative Work in Design (CSCWD)},
This paper presents a deduplication storage system over cloud computing. Our deduplication storage system consists of two major components, a front-end deduplication application and Hadoop Distributed File System. Hadoop Distributed File System is common back-end distribution file system, which is used with a Hadoop database. We use Hadoop Distributed File System to build up a mass storage system and use a Hadoop database to build up a fast indexing system. With the deduplication applications… CONTINUE READING
Highly Cited
This paper has 21 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 14 extracted citations

A Proposal for Improving Data Deduplication with Dual Side Fixed Size Chunking Algorithm

2013 Third International Conference on Advances in Computing and Communications • 2013
View 7 Excerpts
Highly Influenced

GDedup: Distributed File System Level Deduplication for Genomic Big Data

2018 IEEE International Congress on Big Data (BigData Congress) • 2018
View 1 Excerpt

Cloud Based Deduplication on Encrypted Data

Ankush R. Deshmukh, Prof. R. V. Mante, Dr. P N. Chatur
View 1 Excerpt

Secure data deduplication mechanism based on Rabin CDC and MD5 in cloud computing environment

2017 2nd IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT) • 2017

A Survey : Deduplication Ontologies

Sulakshana S. Patange


Publications referenced by this paper.
Showing 1-10 of 21 references

MAD2: A scalable high-throughput exact deduplication approach for network backup services

2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST) • 2010
View 2 Excerpts

Extreme Binning: Scalable, parallel deduplication for chunk-based file backup

2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems • 2009
View 2 Excerpts

HYDRAstor: a ScaJable Secondary Storage

D. Cezary, G. Leszek, +6 authors W. MichaJ
Proccedings of the 7th conference on File and storage technologies, San Francisco, CaJifornia, 2009, pp. 197-210. • 2009
View 2 Excerpts

The Diverse and Exploding Digital Universe

J. F. Gantz, C. Chute, +4 authors A. Toncheva
URL: lanalyst­ reports/diverse-exploding-digital-universe.pdf • 2008
View 1 Excerpt

Similar Papers

Loading similar papers…