Hans P. Reiser

Learn More
Proactive recovery is a promising approach for building fault and intrusion tolerant systems that tolerate an arbitrary number of faults during system lifetime. This paper investigates the benefits that a virtualization-based replication infrastructure can offer for implementing proactive recovery. Our approach uses the hypervisor to initialize a new(More)
Proactive recovery of replicated services is a novel approach that allows tolerating a potentially unlimited number of malicious faults during system lifetime by periodically restarting replicas from a correct state. Recovering a stateful replica requires a time-consuming transfer and verification of the state. During this time, the replica usually is(More)
The design of distributed applications based on a fragmented object model has many benefits. Unlike traditional middleware with a RPC-based client-server interaction, the implementation of a fragmented object may be distributed over an arbitrary number of fragments, without restrictions on internal structure or interaction, while maintaining a transparent,(More)
For the development of CORBA applications, an IDL compiler is needed that generates code for communication stubs, helper classes and implementation skeletons. For each IDL language mapping, for every version of a particular language mapping, and for every CORBA implementation, the generated code has to be different. Typically many different compilers have(More)
The goal of the FOREVER project is to develop a service for <i>Fault/intrusiOn REmoVal through Evolution & Recovery</i>. In order to achieve this goal, our work addresses three main tasks: the definition of the FOREVER service architecture; the analysis of how diversity techniques can improve resilience; and the evaluation of the FOREVER service. The(More)
Using distributed consensus algorithms is an established way to implement group communication systems with total message ordering. This paper presents a system architecture focusing on (re-)configurability of such systems. Unlike other systems, ours can be configured to work with different failure models and low-level communication protocols without(More)
Fault tolerance is becoming an increasingly important requirement in distributed systems. Many existing faulttolerant infrastructures assume a fail-stop behaviour, which, however, is too idealistic to be always valid in practice. While techniques such as checksums and error correcting codes on storage entities can compensate or detect some hardware faults,(More)
Despite the fact that cloud computing offers a high degree of dynamism on resource provisioning, there is a general lack of support for managing dynamic adaptations of replicated services in the cloud, and, even when such support exists, it is focused mainly on elasticity by means of horizontal scalability. We analyse the benefits a replicated service may(More)
CloudFIT is an ongoing project that designs an architecture for intrusiontolerant applications that can be deployed dynamically in the cloud. This position paper presents an outline of the architecture that is being developed in the project, and discusses the implications of the deployment in the cloud. We explore to what extent existing BFT algorithms can(More)