Leveraging 3D PCRAM technologies to reduce checkpoint overhead for future exascale systems

@article{Dong2009Leveraging3P,
  title={Leveraging 3D PCRAM technologies to reduce checkpoint overhead for future exascale systems},
  author={Xiangyu Dong and Naveen Muralimanohar and Norman P. Jouppi and Richard Kaufmann and Yuan Xie},
  journal={Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis},
  year={2009},
  pages={1-12}
}
The scalability of future massively parallel processing (MPP) systems is challenged by high failure rates. Current hard disk drive (HDD) checkpointing results in overhead of 25% or more at the petascale. With a direct correlation between checkpoint frequencies and node counts, novel techniques that can take more frequent checkpoints with minimum overhead… CONTINUE READING