Corpus ID: 17473289

Managing and Monitoring High-Performance Computing Clusters with IPMI

@inproceedings{FangManagingAM,
  title={Managing and Monitoring High-Performance Computing Clusters with IPMI},
  author={Yung-Chin Fang and Garima Kochhar and Randy Deroeck}
}
H igh-performance computing (HPC) clusters are widely used for compute-intensive, transaction-intensive, and I/O-intensive applications. The benefits that enterprises can derive from standards-based HPC clusters compared to large symmetric multiprocessing (SMP)–based supercomputers are well known, including scalability, ease of technology refresh, reusability of components, and disaster recovery capabilities. 1 However, cluster mean time between failures (MTBF) is inversely proportional to the… CONTINUE READING

Figures from this paper.