An efficient logging algorithm for incremental replay of message-passing applications

@article{Zambonelli1999AnEL,
  title={An efficient logging algorithm for incremental replay of message-passing applications},
  author={Franco Zambonelli},
  journal={Proceedings 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing. IPPS/SPDP 1999},
  year={1999},
  pages={392-398}
}
To support incremental replay of message-passing applications, processes must periodically checkpoint and the content of some messages must be logged, to break dependencies of the current state of the execution on past events. The paper presents a new adaptive logging algorithm that dynamically decides whether to log a message based on dependencies the incoming message introduces on past events of the execution. The paper discusses the implementation issues of the algorithm and evaluates its… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-10 OF 13 CITATIONS

Debugging Parallel Systems: A State of the Art Report

VIEW 7 EXCERPTS
CITES BACKGROUND
HIGHLY INFLUENCED

Constant Execution Time Recording for Replay of Sporadic Real-Time Systems

Joel Huselius, Henrik Thane Mälardalen
  • 2004
VIEW 4 EXCERPTS
CITES BACKGROUND & METHODS
HIGHLY INFLUENCED

Replay Debugging for Multi-threaded Embedded Software

  • 2010 IEEE/IFIP International Conference on Embedded and Ubiquitous Computing
  • 2010
VIEW 1 EXCERPT
CITES BACKGROUND

Hybrid checkpoint protocol for cell-dependent infrastructured networks

  • 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings.
  • 2004

References

Publications referenced by this paper.
SHOWING 1-10 OF 11 REFERENCES

Adaptive message logging for incremental program replay

  • IEEE Parallel & Distributed Technology: Systems & Applications
  • 1993
VIEW 6 EXCERPTS
HIGHLY INFLUENTIAL

On the effectiveness of distributed checkpoint algorithms for domino-free recovery

  • Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244)
  • 1998
VIEW 1 EXCERPT

Preventing useless checkpoints in distributed computations

  • Proceedings of SRDS'97: 16th IEEE Symposium on Reliable Distributed Systems
  • 1997
VIEW 2 EXCERPTS

Replaying distributed programs without message logging

  • Proceedings. The Sixth IEEE International Symposium on High Performance Distributed Computing (Cat. No.97TB100183)
  • 1997
VIEW 3 EXCERPTS

On-the-fly replay: a practical paradigm and its implementation for distributed debugging

  • Proceedings of 1994 6th IEEE Symposium on Parallel and Distributed Processing
  • 1994
VIEW 1 EXCERPT

Babaoglu and K . Marzullo . Consistent global states of distributed systems

M. Hurfin Gerstel, N. Plouzeau, M. Raynal, S. Zaks
  • 1993