Share This Author
Anatomy of high-performance matrix multiplication
- Kazushige Goto, R. V. D. Geijn
- Computer ScienceTOMS
- 1 May 2008
We present the basic principles that underlie the high-performance implementation of the matrix-matrix multiplication that is part of the widely used GotoBLAS library. Design decisions are justified…
SUMMA: scalable universal matrix multiplication algorithm
- R. V. D. Geijn, J. Watts
- Computer ScienceConcurr. Pract. Exp.
- 1 April 1995
In this paper, we give a straight forward, highly efficient, scalable implementation of common matrix multiplication operations. The algorithms are much simpler than previously published methods,…
BLIS: A Framework for Rapidly Instantiating BLAS Functionality
- F. V. Zee, R. V. D. Geijn
- Computer ScienceACM Trans. Math. Softw.
- 1 June 2015
TLDR
High-performance implementation of the level-3 BLAS
- Kazushige Goto, R. V. D. Geijn
- Computer Science, EngineeringTOMS
- 22 July 2008
A simple but highly effective approach for transforming high-performance implementations on cache-based architectures of matrix-matrix multiplication into implementations of other commonly used…
Elemental: A New Framework for Distributed Memory Dense Matrix Computations
- J. Poulson, Bryan Marker, R. V. D. Geijn, J. Hammond, N. A. Romero
- Computer ScienceTOMS
- 1 February 2013
TLDR
Collective communication: theory, practice, and experience
- E. Chan, M. Heimlich, A. Purkayastha, R. V. D. Geijn
- PsychologyConcurr. Comput. Pract. Exp.
- 10 September 2007
TLDR
Using PLAPACK - parallel linear algebra package
- R. V. D. Geijn
- Computer Science
- 30 May 1997
Parallel Solution of Integral Equation-Based EM Problems in the Frequency Domain
- Yu Zhang, T. Sarkar, D. G. Donoro, H. Moon, M. Taylor, R. V. D. Geijn
- Computer Science
- 22 June 2009
TLDR
SuperMatrix: a multithreaded runtime scheduling system for algorithms-by-blocks
- E. Chan, F. V. Zee, P. Bientinesi, E. S. Quintana‐Ortí, G. Quintana-Ortí, R. V. D. Geijn
- Computer SciencePPoPP
- 20 February 2008
This paper describes SuperMatrix, a runtime system that parallelizes matrix operations for SMP and/or multi-core architectures. We use this system to demonstrate how code described at a high level of…
Supermatrix out-of-order scheduling of matrix operations for SMP and multi-core architectures
- E. Chan, E. S. Quintana‐Ortí, G. Quintana-Ortí, R. V. D. Geijn
- Computer ScienceSPAA '07
- 9 June 2007
TLDR
...
...