Failure detector

In distributed computing, a failure detector is an application or a subsystem that is responsible for detection of node failures or crashes in a… (More)
Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2004
Highly Cited
2004
Detecting failures is a fundamental issue for fault-tolera nce in distributed systems. Recently, many people have come to realize… (More)
Is this relevant?
Highly Cited
2003
Highly Cited
2003
Unreliable failure detectors introduced by Chandra and Toueg are abstract mechanisms that provide information on process failures… (More)
  • figure 1
  • figure 2
Is this relevant?
Highly Cited
2002
Highly Cited
2002
Chandra and Toueg introduced the concept of unreliable failure detectors. They showed how, by adding these detectors to an… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Is this relevant?
Highly Cited
2001
Highly Cited
2001
Process groups in distributed applications and services rely on failure detectors to detect process failures completely, and as… (More)
  • figure 1
  • figure 2
  • figure 3
Is this relevant?
Highly Cited
1999
Highly Cited
1999
We consider partitionable networks with process crashes and lossy links, and focus on the problems of reliable communication and… (More)
  • figure I
  • figure 3
  • figure 4
  • figure 5
Is this relevant?
Highly Cited
1999
Highly Cited
1999
This paper addresses the Consensus problem in asynchronous distributed systems (made of n processes, at most f of them may crash… (More)
Is this relevant?
Highly Cited
1997
Highly Cited
1997
We study failure detectors in asynchronous environments using a novel generic formulation of failure detection properties which… (More)
Is this relevant?
Highly Cited
1997
Highly Cited
1997
We study the problem of achieving reliable communication with quiescent algorithms (i.e., algorithms that eventually stop sending… (More)
  • figure 1
  • figure 2
  • figure 3
Is this relevant?
Highly Cited
1997
Highly Cited
1997
 Consensus is one of the most fundamental problems in the context of fault-tolerant distributed computing. The problem consists… (More)
  • figure 1
Is this relevant?
Highly Cited
1996
Highly Cited
1996
We introduce the concept of unreliable failure detectors and study how they can be used to solve Consensus in asynchronous… (More)
  • figure 1
  • figure 2
  • figure 8
  • figure 9
  • figure 11
Is this relevant?