Staggered Consistent Checkpointing

  title={Staggered Consistent Checkpointing},
  author={Nitin H. Vaidya},
  journal={IEEE Trans. Parallel Distrib. Syst.},
ÐA consistent checkpointing algorithm saves a consistent view of a distributed application's state on stable storage. The traditional consistent checkpointing algorithms require different processes to save their state at about the same time. This causes contention for the stable storage, potentially resulting in large overheads. Staggering the checkpoints taken by various processes can reduce checkpoint overhead. This paper presents a simple approach to arbitrarily stagger the checkpoints. Our… CONTINUE READING
Highly Cited
This paper has 30 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 20 extracted citations

An optimistic checkpointing and selective message logging approach for consistent global checkpoint collection in distributed systems

2007 IEEE International Parallel and Distributed Processing Symposium • 2007
View 1 Excerpt
Highly Influenced

Checkpointing Using Mobile Agents in Distributed Systems

2007 International Conference on Computing: Theory and Applications (ICCTA'07) • 2007
View 2 Excerpts


Publications referenced by this paper.
Showing 1-10 of 23 references

aEfficient Checkpointing on MIMD Architectures,o

J. S. Plank
PhD thesis, • 1993
View 20 Excerpts
Highly Influenced

aCheckpoint-Based Forward Recovery Using Lookahead Execution and Rollback Validation in Parallel and Distributed Systems,o

J. Long
PhD thesis, Univ. of Illinois-Urbana, • 1992
View 5 Excerpts
Highly Influenced

Snapshots: Determining Global States in Distributed Systems,o

K. M. Chandy, L. Lamport, aDistributed
ACM Trans. Computer Systems, • 1985
View 8 Excerpts
Highly Influenced

aDistributed System Fault Tolerance Using Message Logging and Checkpointing,o

D. B. Johnson
PhD thesis, • 1989
View 4 Excerpts
Highly Influenced

aCLIP: A Checkpointing Tool for Message-Passing Parallel Programs,o

Y. Chen, J. S. Plank, K. Li
Proc. SC97: High Performance Networking and Computing, • 1997
View 1 Excerpt

aA Survey of Rollback-Recovery Protocols in Message-Passing Systems,o

E. Elnozahy, D. B. Johnson, Y. M. Wang
Technical Report CMU-CS-96-181, Carnegie Mellon Univ., • 1996
View 2 Excerpts

Similar Papers

Loading similar papers…