A New Approach to System-Level Fault-Tolerance in Message-Passing MultiComputers

The loop is a commonly used interconnection network for computer systems. In this paper we consider the problem of making a loop network fault-tolerant. Previous solutions employ the absolute minimum number of redundant components, for a specified level of fault tolerance. In our approach, "extra" redundancy is used to reduce the size and complexity of the… (More)