Probabilistic deduplication for cluster-based storage systems

  title={Probabilistic deduplication for cluster-based storage systems},
  author={Davide Frey and Anne-Marie Kermarrec and Konstantinos Kloudas},
The need to backup huge quantities of data has led to the development of a number of distributed deduplication techniques that aim to reproduce the operation of centralized, single-node backup systems in a cluster-based environment. At one extreme, stateful solutions rely on indexing mechanisms to maximize deduplication. However the cost of these strategies in terms of computation and memory resources makes them unsuitable for large-scale storage systems. At the other extreme, stateless… CONTINUE READING
Highly Cited
This paper has 26 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 17 extracted citations


Publications referenced by this paper.
Showing 1-7 of 7 references

The Diverse and Exploding Digital Universe: An Updated Forecast of Worldwide Information

  • J. F. Gantz, C. Chute, +4 authors A. Toncheva
  • Growth Through
  • 2011
Highly Influential
3 Excerpts

Similar Papers

Loading similar papers…