@article{Hayashibara2004TheP,
title={The /spl phi/ accrual failure detector},
author={Naohiro Hayashibara and Xavier D{\'e}fago and Rami Yared and Takuya Katayama},
journal={Proceedings of the 23rd IEEE International Symposium on Reliable Distributed Systems, 2004.},
year={2004},
pages={66-78}
}
The detection of failures is a fundamental issue for fault-tolerance in distributed systems. Recently, many people have come to realize that failure detection ought to be provided as some form of generic service, similar to IP address lookup or time synchronization. However, this has not been successful so far; one of the reasons being the fact that classical failure detectors were not designed to satisfy several application requirements simultaneously. We present a novel abstraction, called… CONTINUE READING
Performance evaluation of a failure detector using SNMP
M. Müller
Maste r’s hesis,École Polytechnique F́ ed́erale de Lausanne, Switzerland, Feb. 2004. [Online]. Available: http://lsewww.epfl.ch/Publica tions/ById/366.html 5These two failure detectors were aimed at different problems, that they both solve admirably well. • 2004