Using Hidden Semi-Markov Models for Effective Online Failure Prediction

  title={Using Hidden Semi-Markov Models for Effective Online Failure Prediction},
  author={Felix Salfner and Miroslaw Malek},
  journal={2007 26th IEEE International Symposium on Reliable Distributed Systems (SRDS 2007)},
A proactive handling of faults requires that the risk of upcoming failures is continuously assessed. One of the promising approaches is online failure prediction, which means that the current state of the system is evaluated in order to predict the occurrence of failures in the near future. More specifically, we focus on methods that use event-driven sources such as errors. We use hidden semi-Markov models (HSMMs)for this purpose and demonstrate effectiveness based on field data of a commercial… CONTINUE READING
Highly Influential
This paper has highly influenced 10 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 131 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 81 extracted citations

PreFix: Switch Failure Prediction in Datacenter Networks

POMACS • 2018
View 20 Excerpts
Highly Influenced

Towards Identifying the Best Variables for Failure Prediction Using Injection of Realistic Software Faults

2010 IEEE 16th Pacific Rim International Symposium on Dependable Computing • 2010
View 15 Excerpts
Highly Influenced

PreFix: Switch Failure Prediction in Datacenter Networks

View 4 Excerpts
Highly Influenced

Hard drive failure prediction using Decision Trees

Rel. Eng. & Sys. Safety • 2017
View 2 Excerpts
Highly Influenced

Syslog processing for switch failure diagnosis and prediction in datacenter networks

2017 IEEE/ACM 25th International Symposium on Quality of Service (IWQoS) • 2017
View 5 Excerpts
Highly Influenced

Seer: A Lightweight Online Failure Prediction Approach

IEEE Transactions on Software Engineering • 2016
View 5 Excerpts
Highly Influenced

131 Citations

Citations per Year
Semantic Scholar estimates that this publication has 131 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 33 references

Design and evaluation of an on-line predictive diagnostic system

T.-T.Y. Lin
PhD thesis, Department of Electrical and Computer Engineering, • 1988
View 7 Excerpts
Highly Influenced

Call Availability Prediction in a Telecommunication System: A Data Driven Empirical Approach

2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06) • 2006
View 1 Excerpt

Predicting failures of computer systems: a case study for a telecommunication system

Proceedings 20th IEEE International Parallel & Distributed Processing Symposium • 2006
View 4 Excerpts


O. Babaoglu, M. Jelasity, A. Montresor, C. Fetzer
Leonardi, van Moorsel A., and M. van Steen, editors. Self-Star Properties in Complex Information Systems, volume 3460 of Lecture Notes in Computer Science. Springer- Verlag • 2005

A hidden Markov model for Internet channels

Proceedings of the 3rd IEEE International Symposium on Signal Processing and Information Technology (IEEE Cat. No.03EX795) • 2003
View 1 Excerpt

Similar Papers

Loading similar papers…