Replay For Concurrent Non-Deterministic Shared Memory Applications

Abstract

Replay of shared-memory program execution is desirable in many domains including cyclic debugging, fault tolerance and performance monitoring. Past approaches to repeatable execution have focused on the problem of re-executing the shared-memory access patterns in parallel programs. With the proliferation of operating system supported threads and shared memory for uniprocessor programs, there is a clear need for efficient replay of concurrent applications. The solutions for parallel systems can be performance prohibitive when applied to the uniprocessor case. We present an algorithm, called the repeatable scheduling algorithm, combining scheduling and instruction counts to provide an invariant for efficient, language independent replay of concurrent shared-memory applications. The approach is shown to have trace overheads that are independent of the amount of sharing that takes place. An implementation for cyclic debugging on Mach 3.0 is evaluated and benchmarks show typical performance overheads of around 10%. The algorithm implemented is compared with optimal event-based tracing and shown to do better with respect to the number of events monitored or number of events logged, in most cases by several orders of magnitude.

DOI: 10.1145/231379.231432

Extracted Key Phrases

7 Figures and Tables

Statistics

01020'97'99'01'03'05'07'09'11'13'15'17
Citations per Year

137 Citations

Semantic Scholar estimates that this publication has 137 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@inproceedings{Russinovich1996ReplayFC, title={Replay For Concurrent Non-Deterministic Shared Memory Applications}, author={Mark Russinovich and Bryce Cogswell}, booktitle={PLDI}, year={1996} }