Automatic model-driven recovery in distributed systems

@article{Joshi2005AutomaticMR,
  title={Automatic model-driven recovery in distributed systems},
  author={Kaustubh R. Joshi and William H. Sanders and Matti A. Hiltunen and Richard D. Schlichting},
  journal={24th IEEE Symposium on Reliable Distributed Systems (SRDS'05)},
  year={2005},
  pages={25-36}
}
Automatic system monitoring and recovery has the potential to provide a low-cost solution for high availability. However, automating recovery is difficult in practice because of the challenge of accurate fault diagnosis in the presence of low coverage, poor localization ability, and false positives that are inherent in many widely used monitoring techniques. In this paper, we present a holistic model-based approach that overcomes these challenges and enables automatic recovery in distributed… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-10 OF 41 CITATIONS

Software self-recovery method of AUV based on micro-reboot

  • 2011 9th World Congress on Intelligent Control and Automation
  • 2011
VIEW 6 EXCERPTS
CITES BACKGROUND
HIGHLY INFLUENCED

Automatic Recovery Using Bounded Partially Observable Markov Decision Processes

  • International Conference on Dependable Systems and Networks (DSN'06)
  • 2006
VIEW 4 EXCERPTS
CITES METHODS

Scalable Rollback for Cloud Operations Using AI Planning

  • 2015 24th Australasian Software Engineering Conference
  • 2015
VIEW 1 EXCERPT
CITES METHODS

References

Publications referenced by this paper.
SHOWING 1-10 OF 17 REFERENCES