Software Exploitation of a Fault-Tolerant Computer with a Large Memory

  title={Software Exploitation of a Fault-Tolerant Computer with a Large Memory},
  author={Frank Eskesen and Michel Hack and Arun Iyengar and Richard P. King and Nagui Halim},
The DM/6000 hardware (a prototype, fault-tolerant RS/6000 built at the TJ Watson Research Center) provides fault tolerance and a large, non-volatile main memory. Running a commercial, general-purpose operating system on it, of itself, does nothing to increase software availability. In fact, the time to rebuild the contents of a large memory may decrease availability. We describe our techniques for hiding most of the main memory, which requires the operating system to access it only by way of… CONTINUE READING


Publications referenced by this paper.
Showing 1-10 of 17 references

Evaluating HACMP/6000: A Clustering Solution for High Availability Distrib- uted Systems

  • G. Ahrens
  • In IEEE Conference on Fault- Tolerant Parallel…
  • 1994