Energy profile of rollback-recovery strategies in high performance computing

@article{Meneses2014EnergyPO,
  title={Energy profile of rollback-recovery strategies in high performance computing},
  author={Esteban Meneses and Osman Sarood and Laxmikant V. Kal{\'e}},
  journal={Parallel Computing},
  year={2014},
  volume={40},
  pages={536-547}
}
Extreme-scale computing is set to provide the infrastructure for the advances and breakthroughs that will solve some of the hardest problems in science and engineering. However, resilience and energy concerns loom as two of the major challenges for machines at that scale. The number of components that will be assembled in the supercomputers plays a fundamental role in these challenges. First, a large number of parts will substantially increase the failure rate of the system compared to the… CONTINUE READING