Accelerating scientific computations with mixed precision algorithms
@article{Baboulin2009AcceleratingSC, title={Accelerating scientific computations with mixed precision algorithms}, author={M. Baboulin and A. Buttari and J. Dongarra and J. Kurzak and J. Langou and Julien Langou and P. Luszczek and S. Tomov}, journal={Comput. Phys. Commun.}, year={2009}, volume={180}, pages={2526-2533} }
On modern architectures, the performance of 32-bit operations is often at least twice as fast as the performance of 64-bit operations. By using a combination of 32-bit and 64-bit floating point arithmetic, the performance of many dense and sparse linear algebra algorithms can be significantly enhanced while maintaining the 64-bit accuracy of the resulting solution. The approach presented here can apply not only to conventional processors but also to other technologies such as Field Programmable… CONTINUE READING
Figures, Tables, and Topics from this paper
145 Citations
A floating point conversion algorithm for mixed precision computations
- Computer Science
- Journal of Zhejiang University SCIENCE C
- 2012
Mixed-Precision Solution of Linear Systems Using Accelerator-Based Computing
- Computer Science
- 2020
- 5
- PDF
Investigating half precision arithmetic to accelerate dense linear system solvers
- Computer Science
- ScalA@SC
- 2017
- 34
- PDF
Towards Half-Precision Computation for Complex Matrices: A Case Study for Mixed Precision Solvers on GPUs
- Computer Science
- 2019 IEEE/ACM 10th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems (ScalA)
- 2019
- 2
- PDF
Toward a modular precision ecosystem for high-performance computing
- Computer Science
- Int. J. High Perform. Comput. Appl.
- 2019
- 3
- PDF
Towards numerical benchmark for half-precision floating point arithmetic
- Computer Science
- 2017 IEEE High Performance Extreme Computing Conference (HPEC)
- 2017
- 8
- PDF
Harnessing GPU Tensor Cores for Fast FP16 Arithmetic to Speed up Mixed-Precision Iterative Refinement Solvers
- Computer Science
- SC18: International Conference for High Performance Computing, Networking, Storage and Analysis
- 2018
- 76
- PDF
Implications of Reduced-Precision Computations in HPC: Performance, Energy and Error
- Computer Science
- PARCO
- 2017
- 14
- PDF
Increasing Accuracy of Iterative Refinement in Limited Floating-Point Arithmetic on Half-Precision Accelerators
- Computer Science
- 2019 IEEE High Performance Extreme Computing Conference (HPEC)
- 2019
- 1
- PDF
References
SHOWING 1-10 OF 105 REFERENCES
Using Mixed Precision for Sparse Matrix Computations to Enhance the Performance while Achieving 64-bit Accuracy
- Computer Science
- TOMS
- 2008
- 86
- PDF
Mixed Precision Iterative Refinement Techniques for the Solution of Dense Linear Systems
- Computer Science
- Int. J. High Perform. Comput. Appl.
- 2007
- 121
- PDF
Exploiting the Performance of 32 bit Floating Point Arithmetic in Obtaining 64 bit Accuracy (Revisiting Iterative Refinement for Linear Systems)
- Mathematics, Computer Science
- ACM/IEEE SC 2006 Conference (SC'06)
- 2006
- 141
- PDF
Solving Systems of Linear Equations on the CELL Processor Using Cholesky Factorization
- Computer Science
- IEEE Transactions on Parallel and Distributed Systems
- 2008
- 127
- PDF
Implementation of mixed precision in solving systems of linear equations on the Cell processor
- Computer Science
- Concurr. Comput. Pract. Exp.
- 2007
- 74
- PDF
Exploiting fast hardware floating point in high precision computation
- Mathematics, Computer Science
- ISSAC '03
- 2003
- 21
A Fully Asynchronous Multifrontal Solver Using Distributed Dynamic Scheduling
- Computer Science, Mathematics
- SIAM J. Matrix Anal. Appl.
- 2001
- 1,621
- PDF
Algorithm 656: an extended set of basic linear algebra subprograms: model implementation and test programs
- Computer Science
- TOMS
- 1988
- 209
- PDF
Algorithm 679: A set of level 3 basic linear algebra subprograms: model implementation and test programs
- Computer Science, Mathematics
- TOMS
- 1990
- 283
- PDF