Building Reliable High-Performance Storage Systems: An Empirical and Analytical Study

@article{Qiao2019BuildingRH,
  title={Building Reliable High-Performance Storage Systems: An Empirical and Analytical Study},
  author={Z. Qiao and S. Fu and Hsing-Bung Chen and B. Settlemyer},
  journal={2019 IEEE International Conference on Cluster Computing (CLUSTER)},
  year={2019},
  pages={1-10}
}
  • Z. Qiao, S. Fu, +1 author B. Settlemyer
  • Published 2019
  • Computer Science
  • 2019 IEEE International Conference on Cluster Computing (CLUSTER)
  • Due to the vast storage needs of high performance computing (HPC), the scale and complexity of storage systems in HPC data centers continue growing. Disk failures have become the norm. With the ever-increasing disk capacity, RAID recovery based on disk rebuild becomes more and more expensive, which causes significant performance degradation and even unavailability of storage systems. Declustered redundant array of independent disks shuffle data and parity blocks among all drives in a RAID group… CONTINUE READING
    1 Citations
    A Smart Background Scheduler for Storage Systems
    • Maher Kachmar, D. Kaeli
    • Computer Science
    • 2020 28th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS)
    • 2020
    • PDF

    References

    SHOWING 1-10 OF 35 REFERENCES
    Characterizing and Modeling Reliability of Declustered RAID for HPC Storage Systems
    • 2
    Evaluation of distributed recovery in large-scale storage systems
    • Q. Xin, E. L. Miller, T. Schwarz
    • Computer Science
    • Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004.
    • 2004
    • 89
    • PDF
    Multi-Partition RAID: A New Method for Improving Performance of Disk Arrays under Failure
    • 13
    • PDF
    On the role of burst buffers in leadership-class storage systems
    • N. Liu, J. Cope, +5 authors C. Maltzahn
    • Computer Science
    • 012 IEEE 28th Symposium on Mass Storage Systems and Technologies (MSST)
    • 2012
    • 293
    • PDF
    Improving Availability of RAID-Structured Storage Systems by Workload Outsourcing
    • 21
    • PDF
    Performance Analysis of Disk Arrays under Failure
    • 222
    • PDF
    An Early Functional and Performance Experiment of the MarFS Hybrid Storage EcoSystem
    • 8
    RAIDShield: Characterizing, Monitoring, and Proactively Protecting Against Disk Failures
    • 76
    • PDF
    Characterizing Disk Health Degradation and Proactively Protecting Against Disk Failures for Reliable Storage Systems
    • 4
    Parity declustering for continuous operation in redundant disk arrays
    • 227
    • Highly Influential
    • PDF