Disk Failures in the Real World: What Does an MTTF of 1, 000, 000 Hours Mean to You?

  title={Disk Failures in the Real World: What Does an MTTF of 1, 000, 000 Hours Mean to You?},
  author={Bianca Schroeder and Garth A. Gibson},
Component failure in large-scale IT installations is becoming an ever larger problem as the number of components in a single cluster approaches a million. In this paper, we present and analyze field-gathered disk replacement data from a number of large production systems, including high-performance computing sites and internet services sites. About 100,000 disks are covered by this data, some for an entire lifetime of five years. The data include drives with SCSI and FC, as well as SATA… CONTINUE READING
Highly Cited
This paper has 76 citations. REVIEW CITATIONS
51 Citations
37 References
Similar Papers


Publications citing this paper.
Showing 1-10 of 51 extracted citations

77 Citations

Citations per Year
Semantic Scholar estimates that this publication has 77 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 37 references

Specifying reliability in the disk drive industry: No more MTBFs

  • J. G. Elerath
  • In Proc. of the Annual Reliability and…
  • 2000
Highly Influential
5 Excerpts

A comprehensive review of hard-disk drive reliability

  • J. Yang, F.-B. Sun
  • In Proc. of the Annual Reliability and…
  • 1999
Highly Influential
6 Excerpts

Disk failure investigations at the internet archive

  • T. Schwarz, M. Baker, +6 authors M. Shah
  • In Work-in-Progess session, NASA/IEEE Conference…
  • 2006
2 Excerpts

Arpaci - Dusseau . Iron file system

  • Vijayan Prabhakaran, Lakshmi N. Bairavasundaram, Nitin Agrawal, Haryadi S. Gunawi, Andrea C. Arpaci-Dusseau, H. Remzi
  • SOSP ’ 05 : Proceedings of the twentieth ACM…
  • 2005

Similar Papers

Loading similar papers…