Exploiting Application-Level Correctness for Low-Cost Fault Tolerance

@article{Li2008ExploitingAC,
  title={Exploiting Application-Level Correctness for Low-Cost Fault Tolerance},
  author={Xuanhua Li and Donald Yeung},
  journal={J. Instruction-Level Parallelism},
  year={2008},
  volume={10}
}
Traditionally, fault tolerance researchers have required architectural state to be numerically perfect for program execution to be correct. However, in many programs, even if execution is not 100% numerically correct, the program can still appear to execute correctly from the user’s perspective. Hence, whether a fault is unacceptable or benign may depend on the level of abstraction at which correctness is evaluated, with more faults being benign at higher levels of abstraction, i.e. at the user… CONTINUE READING
Highly Cited
This paper has 19 citations. REVIEW CITATIONS

Citations

Publications citing this paper.
Showing 1-10 of 14 extracted citations

Reliability analysis of on-chip communication architectures: An MPEG-2 video decoder case study

Microprocessors and Microsystems - Embedded Hardware Design • 2011
View 4 Excerpts
Highly Influenced

Comparative reliability analysis between AMBA and network-on-chip: An MPEG-2 case study

2009 IEEE International SOC Conference (SOCC) • 2009
View 5 Excerpts
Highly Influenced

Neural acceleration for GPU throughput processors

2015 48th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) • 2015

References

Publications referenced by this paper.
Showing 1-10 of 35 references

Application-Level Correctness and its Impact on Fault Tolerance

2007 IEEE 13th International Symposium on High Performance Computer Architecture • 2007
View 4 Excerpts

Characterization of Error-Tolerant Applications when Protecting Control Data

2006 IEEE International Symposium on Workload Characterization • 2006
View 2 Excerpts

Efficient techniques for clustering and scheduling onto embedded multiprocessors

IEEE Transactions on Parallel and Distributed Systems • 2006
View 2 Excerpts

Design and evaluation of hybrid fault-detection systems

32nd International Symposium on Computer Architecture (ISCA'05) • 2005
View 3 Excerpts

Similar Papers

Loading similar papers…