Corpus ID: 18964633

SWAT : An Error Resilient System

@inproceedings{Li2008SWATA,
  title={SWAT : An Error Resilient System},
  author={Man-Lap Li and Pradeep Ramachandran and S. Sahoo and S. Adve and V. Adve and Yuanyuan Zhou},
  year={2008}
}
As devices continue to scale, future shipped hardware is more likely to fail due to in-the-field hardware faults. As traditional redundancy-based hardware reliability solutions are too expensive to be broadly deployable, recent research has focused on low-overhead reliability so lutions. One approach is to employ low-overhead detection (always-on) techniques that catch high-level symptoms and pay a higher overhead for diagnosis (rarely invoked). To this end, we are developing SWAT (SoftWare… Expand
An Accurate Cross-Layer Approach for Online Architectural Vulnerability Estimation
A Software Solution for Hardware Vulnerabilities
An algorithmic approach to error localization and partial recomputation for low-overhead fault tolerance
...
1
2
3
4
...

References

SHOWING 1-10 OF 11 REFERENCES
AR-SMT: a microarchitectural approach to fault tolerance in microprocessors
  • E. Rotenberg
  • Computer Science
  • Digest of Papers. Twenty-Ninth Annual International Symposium on Fault-Tolerant Computing (Cat. No.99CB36352)
  • 1999
A mechanism for online diagnosis of hard faults in microprocessors
Perturbation-based Fault Screening
DIVA: a reliable substrate for deep submicron microarchitecture design
  • T. Austin
  • Computer Science
  • MICRO-32. Proceedings of the 32nd Annual ACM/IEEE International Symposium on Microarchitecture
  • 1999
LLVM: a compilation framework for lifelong program analysis & transformation
  • Chris Lattner, V. Adve
  • Computer Science
  • International Symposium on Code Generation and Optimization, 2004. CGO 2004.
  • 2004
Designing reliable systems from unreliable components: the challenges of transistor variability and degradation
...
1
2
...