Transparent Fault-tolerance in Parallel Orca Programs

Abstract

With the advent of large-scale parallel computing systems, making parallel programs fault-tolerant becomes an important problem, because the probability of a failure increases with the number of processors. In this paper, we describe a very simple scheme for rendering a class of parallel Orca programs fault-tolerant. Also, we discuss our experience with… (More)

2 Figures and Tables

Topics

  • Presentations referencing similar topics