Agreement on the membership of a group of processes in a distributed system is a basic problem that arises in a wide range of applications. Such groups occur when a set of processes co-operate to… (More)
FTCS-23 The Twenty-Third International Symposium…
1993
Failure detectors (or, more accurately, failure suspectors, or FS) appear to be a fundamental service upon which to build fault-tolerant, distributed applications. It is shown that an FS with very… (More)
This paper derives necessary and suucient communication for distributed applications that perform certain actions uniformly in asynchronous systems. We show there is an essential structure of… (More)
The Nile system is a distributed environment for running very large, data-intensive applications across a network of commodity workstations. These applications process data from elementary particle… (More)
This paper describes the design and implementation of a fault-tolerant CORBA naming service CosNamingFT. Every CORBA object is accessed through its Interoperable Object Reference (IOR), which is… (More)
The CLEO project [2], centered at Cornell University, is alarge-scale high energy physics project. The goals of the projectarise from an esoteric question---why is there apparently so… (More)
The goal of the Nile project is to develop an inexpensive, scalable, fault-tolerant, widely distributed job processing environment. On the systems side, Nile must manage and provide transparent… (More)