FIMD-MPI: A Tool for Injecting Faults into MPI Applications

Abstract

Parallel computing is seeing increasing use in critical applications. The need therefore arises to test the robustness of parallel applications in the presence of exceptional conditions, or faults. Communication-software-based fault injection is an extremely flexible approach to robustness testing in message-passing parallel computers. A fault injection methodology and tool that use this approach are presented. The tool, known as FIMD-MPI, allows injection of faults into MPI-based applications. The structure and operation of FIMD-MPI are described and the use of the tool is illustrated on an example fault-tolerant MPI application.

DOI: 10.1109/IPDPS.2000.845991

5 Figures and Tables

Cite this paper

@inproceedings{Blough2000FIMDMPIAT, title={FIMD-MPI: A Tool for Injecting Faults into MPI Applications}, author={Douglas M. Blough and Peng Liu}, booktitle={IPDPS}, year={2000} }