Exploiting virtual synchrony in distributed systems

@inproceedings{Birman1987ExploitingVS,
  title={Exploiting virtual synchrony in distributed systems},
  author={Kenneth P. Birman and Thomas A. Joseph},
  booktitle={SOSP '87},
  year={1987}
}
We describe applications of a virtually synchronous environment for distributed programming, which underlies a collection of distributed programming tools in the ISIS2 system. A virtually synchronous environment allows processes to be structured into process groups, and makes events like broadcasts to the group as an entity, group membership changes, and even migration of an activity from one place to another appear to occur instantaneously — in other words, synchronously. A major advantage to… 

Figures and Tables from this paper

Exploiting replication in distributed systems
Techniques are examined for replicating data and execution in directly distributed systems: systems in which multiple processes interact directly with one another while continuously respecting
Overcoming Failures in a Distributed System
TLDR
This chapter focuses on mechanisms for replicating data and computation while guaranteeing some form of consistent behavior to the end-user through the components of Lamport’s widely known Paxos protocol.
Uniform actions in asynchronous distributed systems
TLDR
The results demonstrate that in contrast to Consensus, which cannot be solved in asynchronous systems with even a single faulty process, dynamic uniformity can be solved using a failure detection mechanism that makes bounded numbers of mistakes.
AD-A 243 512 Maintaining Consistency in Distributed SysteMS *
TLDR
This work proposes an integrated approach that permits applications that use virtual synchrony to with concurrent objects that respect a linearizability constraint, and vice versa, and Transactional subsystems are treated as a special case of linearIZability.
Pattern-Based Composition and Analysis of Virtually Synchronized Real-Time Distributed Systems
TLDR
This paper introduces a formal, complexity-reducing architectural pattern, called Multi-Rate PALS system, to support virtual synchronization in multi-rate distributed computations, and verifies the logical synchronization guarantee of this pattern.
Process Membership in Asynchronous Environments
TLDR
The solution is proposed, called Strong-GMP, which can be supported through a highly efficient protocol, and has been implemented as part of a distributed systems software project at Cornell University.
Integrating Runtime Consistency Models for Distributed Computing
  • K. Birman
  • Computer Science
    J. Parallel Distributed Comput.
  • 1994
TLDR
This work proposes an integrated approach that permits applications that use virtual synchrony to interact with concurrent objects that respect a linearizability constraint, and vice versa.
An Analysis of a Virtually Synchronous Protocol
TLDR
A corrected version of the CBCAST protocol, developed by Birman, Schiper and Stephenson in 1991 and used in their ISIS platform, is described and its formal properties are proved.
Robust and Efficient Replication Using Group Communication
TLDR
A new architecture and algorithm for distributed replicated database systems that exploits a group communication service to minimize communication costs and to eliminate forced disk writes in the critical path, while preserving complete and consistent operation.
Virtually Synchronous Methodology for Dynamic Service Replication
TLDR
This paper explores the topic of recovery from failure in the context of a dynamic reconfiguration model of the author's own design that unifies two widely popular prior approaches to the problem: virtual synchrony, a model and associated protocols for reliable group communication, and state machine replication (in particular, Paxos), a model of deterministic functionality specified as an event-driven state machine.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 36 REFERENCES
Programming with Shared Bulletin Boards in Asynchronus Distributed Systems
TLDR
This paper formalizes the notion of consistent behavior when unreliable processes concurrently access a bulletin board and provides a mechanism for reasoning about consistency in distributed systems, which was previously lacking.
Low cost management of replicated data in fault-tolerant distributed systems
TLDR
A technique is described that relaxes the usual degree of synchronization, permitting replicated data items to be updated concurrently with other operations, while at the same time ensuring that correctness is not violated, which results in better response time when performing operations on replicated data.
Replicated distributed programs
TLDR
A new software architecture for fault-tolerant distributed programs is presented that allows replication to be added transparently and flexibly to existing programs, and integration of the replication mechanisms into current programming languages is accomplished by means of stub compilers.
Synchronization in Distributed Programs
TLDR
The technique can be used to solve synchronization problems directly, to implement new synchronization mechanisms, and to construct distributed versions of existing synchronization mechanisms.
Replication and fault-tolerance in the ISIS system
TLDR
The tedmiques for obtaining a fault-tolerant implementation from a now distributed specification and for achieving improved performanc~ by concurrently updating replicated data are discussed.
Reliable communication in the presence of failures
TLDR
A review of several uses for the protocols in the ISIS system, which supports fault-tolerant resilient objects and bulletin boards, illustrates the significant simplification of higher level algorithms made possible by the approach.
Nested Transactions: An Approach to Reliable Distributed Computing
TLDR
The method for implementing nested transactions is novel in that it uses locking for concurrency control and the necessary algorithms for locking, recovery, distributed commitment, and distributed deadlock detection for a nested transaction system are presented.
Virtual time
Virtual time is a new paradigm for organizing and synchronizing distributed systems which can be applied to such problems as distributed discrete event simulation and distributed database concurrency
REPLICATION METHODS FOR ABSTRACT DATA TYPES
TLDR
A new method for managing replicated data that makes use of type-specific properties of objects to provide more effective replication and can realize a wider range of availability properties, more concurrency, more flexible reconfiguration, and better tolerance of partitions than existing replication methods are introduced.
Replication and fault-tolerance in the ISIS system
TLDR
Techniques for obtaining a fault-tolerant implementation from a non-distributed specification and for achieving improved performance by concurrently updating replicated data are discussed.
...
1
2
3
4
...