• Publications
  • Influence
Applied Numerical Linear Algebra
  • J. Demmel
  • Computer Science, Mathematics
  • 1 September 1997
Preface 1. Introduction 2. Linear equation solving 3. Linear least squares problems 4. Nonsymmetric Eigenvalue problems 5. The symmetric Eigenproblem and singular value decomposition 6. IterativeExpand
  • 2,671
  • 190
  • Open Access
LAPACK Users' Guide, 3rd ed.
  • 922
  • 127
An updated set of basic linear algebra subprograms (BLAS)
L. SUSAN BLACKFORD Myricom, Inc. JAMES DEMMEL University of California, Berkeley JACK DONGARRA The University of Tennessee IAIN DUFF Rutherford Appleton Laboratory and CERFACS SVEN HAMMARLINGExpand
  • 558
  • 62
  • Open Access
A Supernodal Approach to Sparse Partial Pivoting
We investigate several ways to improve the performance of sparse LU factorization with partial pivoting, as used to solve unsymmetric linear systems. We introduce the notion of unsymmetric supernodesExpand
  • 803
  • 56
  • Open Access
Benchmarking GPUs to tune dense linear algebra
We present performance results for dense linear algebra using recent NVIDIA GPUs. Our matrix-matrix multiply routine (GEMM) runs up to 60% faster than the vendor's implementation and approaches theExpand
  • 481
  • 54
LAPACK User's Guide
  • 418
  • 52
IEEE Standard for Floating-Point Arithmetic
  • 1,009
  • 46
Accurate Singular Values of Bidiagonal Matrices
Computing the singular values of a bidiagonal matrix is the final phase of the standard algorithm for the singular value decomposition of a general matrix. A new algorithm that computes all the sin...
  • 360
  • 46
  • Open Access
LAPACK Users' Guide, Third Edition
Preface to the third edition Preface to the secondedition Part 1. Guide. 1. Essentials 2. Contents of LAPACK 3. Performance of LAPACK 4. Accuracy and Stability 5. Documentation and SoftwareExpand
  • 1,012
  • 45
SuperLU_DIST: A scalable distributed-memory sparse direct solver for unsymmetric linear systems
We present the main algorithmic features in the software package SuperLU_DIST, a distributed-memory sparse direct solver for large sets of linear equations. We give in detail our parallelizationExpand
  • 570
  • 45
  • Open Access