A Fault-Tolerant Framework for Asynchronous Iterative Computations in Cloud Environments

@article{Wang2016AFF,
  title={A Fault-Tolerant Framework for Asynchronous Iterative Computations in Cloud Environments},
  author={Zhigang Wang and Lixin Gao and Yu Gu and Yubin Bao and Ge Yu},
  journal={IEEE Transactions on Parallel and Distributed Systems},
  year={2016},
  volume={29},
  pages={1678-1692}
}
Many graph algorithms are iterative in nature and can be supported by distributed memory-based systems in a synchronous manner. However, an asynchronous model has been recently proposed to accelerate iterative computations. Nevertheless, it is challenging to recover from failures in such a system, since a typical checkpointing based approach requires many expensive synchronization barriers that largely offset the gains of asynchronous computations. This paper first proposes a fault-tolerant… CONTINUE READING
4 Citations
5 References
Similar Papers

Similar Papers

Loading similar papers…