Data deduplication

Known as: Deduplication, De-duplication, Duplication 
In computing, data deduplication is a specialized data compression technique for eliminating duplicate copies of repeating data. Related and somewhat… (More)
Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2014
Highly Cited
2014
Data deduplication is a technique for eliminating duplicate copies of data, and has been widely used in cloud storage to reduce… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 5
  • figure 4
Is this relevant?
Highly Cited
2014
Highly Cited
2014
Nowadays, more and more corporate and private users outsource their data to cloud storage providers. At the same time, recent… (More)
  • figure 1
  • figure 2
  • figure 4
Is this relevant?
Highly Cited
2012
Highly Cited
2012
In this paper, a new notion which we call private data deduplication protocol, a deduplication technique for private data storage… (More)
Is this relevant?
Highly Cited
2011
Highly Cited
2011
As data have been growing rapidly in data centers, deduplication storage systems continuously face challenges in providing the… (More)
  • figure 1
  • figure 2
  • table 1
  • figure 3
  • table 2
Is this relevant?
Highly Cited
2010
Highly Cited
2010
As one of the key characteristics of virtualization, live virtual machine (VM) migration provides great benefits for load… (More)
  • table II
  • table III
  • figure 1
  • figure 2
  • figure 3
Is this relevant?
Highly Cited
2009
Highly Cited
2009
© Sparse Indexing: Large Scale, Inline Deduplication Using Sampling and Locality Mark Lillibridge, Kave Eshghi, Deepavali Bhagwat… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Is this relevant?
Highly Cited
2008
Highly Cited
2008
As the world moves to digital storage for archival purposes, there is an increasing demand for systems that can provide secure… (More)
  • figure 1
  • table 1
  • figure 2
  • figure 3
  • table 2
Is this relevant?
Highly Cited
2008
Highly Cited
2008
Disk-based deduplication storage has emerged as the new-generation storage system for enterprise data protection to replace tape… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Is this relevant?
Highly Cited
2008
Highly Cited
2008
Effectiveness and tradeoffs of deduplication technologies are not well understood -- vendors tout Deduplication as a "silver… (More)
  • figure 1
  • table 1
  • table 3
  • table 2
  • figure 2
Is this relevant?
Highly Cited
2002
Highly Cited
2002
Deduplication is a key operation in integrating data from multiple sources. The main challenge in this task is designing a… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 7
Is this relevant?