Algorithm 656: an extended set of basic linear algebra subprograms: model implementation and test programs

@article{Dongarra1988Algorithm6A,
  title={Algorithm 656: an extended set of basic linear algebra subprograms: model implementation and test programs},
  author={Jack J. Dongarra and Jeremy Du Croz and Sven Hammarling and Richard J. Hanson},
  journal={ACM Trans. Math. Softw.},
  year={1988},
  volume={14},
  pages={18-32}
}
This paper describes a model implementation and test software for the Level 2 Basic Linear Algebra Subprograms (Level 2 BLAS). Level 2 BLAS are targeted at matrix-vector operations with the aim of providing more efficient, but portable, implementations of algorithms on high-performance computers. The model implementation provides a portable set of FORTRAN 77 Level 2 BLAS for machines where specialized implementations do not exist or are not required. The test software aims to verify that… 

Tables from this paper

Algorithm 679: A set of level 3 basic linear algebra subprograms: model implementation and test programs
TLDR
A model implementation and test software for the Level 2 Basic Linear Algebra Subprograms, targeted at matrix-vector operations, provide a portable set of FORTRAN 77 Level 2 BLAS for machines where specialized implementations do not exists or are not required.
Preface to the special issue on the basic linear algebra subprograms (BLAS)
TLDR
In this issue of the ACM Transactions on Mathematical Software, a set of research contributions are collected together which provide some of the technical background behind the development of the new BLAS standard.
Local Basic Linear Algebra Subroutines (LBLAS) for the CM-5/5E
TLDR
The implementation considerations and performance of the local BLAS, or BLAS local to each node of the system, are presented, irrespective of the data layout in node mem ory.
An extended set of FORTRAN basic linear algebra subprograms
This paper describes an extension to the set of Basic Linear Algebra Subprograms. The extensions are targeted at matrix-vector operations that should provide for efficient and portable
A set of level 3 basic linear algebra subprograms
This paper describes an extension to the set of Basic Linear Algebra Subprograms. The extensions are targeted at matrix-vector operations that should provide for efficient and portable
A Proposal for a Set of Parallel Basic Linear Algebra Subprograms
TLDR
The PBLAS are targeted at distributed vector-vector, matrixvector and matrix-matrix operations with the aim of simplifying the parallelization of linear algebra codes, especially when implemented on top of the sequential BLAS.
The Multicomputer Toolbox Project Blais Working Note #0: Standard Sequential Mathematical Libraries: Promises and Pitfalls, Opportunities and Challenges
TLDR
A set of requirements for the next generation of standard libraries for numerical linear algebra are proposed, which are called the basic linear algebra instruction set (BLAIS), which draws from modern software design and architecture principles.
Design, implementation and testing of extended and mixed precision BLAS
TLDR
The design rationale, a C implementation, and conformance testing of a subset of the new Standard for the BLAS (Basic Linear Algebra Subroutines): Extended and Mixed Precision BLAS are described, which achieves excellent performance.
New trends in high performance computing
A Rational Approach to Portable High Performance: The Basic Linear Algebra Instruction Set (BLAIS) and the Fixed Algorithm Size Template (FAST) Library
We introduce the Basic Linear Algebra Instruction Set (BLAIS), a collection of high performance kernels for basic linear algebra that encapsulate small fixed size computations to provide building
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 10 REFERENCES
An extended set of FORTRAN basic linear algebra subprograms
This paper describes an extension to the set of Basic Linear Algebra Subprograms. The extensions are targeted at matrix-vector operations that should provide for efficient and portable
Transforming FORTRAN DO loops to improve performance on vector architectures
TLDR
The unrolling transformation and associated transformations of Fortran DO loops are characterized and a set of software tools to carry out these transformations are described, which use the machinery available in Toolpack and have been integrated into that environment.
Implementing Linear Algebra Algorithms for Dense Matrices on a Vector Pipeline Machine
TLDR
This paper examines common implementations of linear algebra algorithms, such as matrix-vector multiplication, matrix-matrix multiplication and the solution of linear equations for efficiency on a computer architecture which uses vector processing and has pipelined instruction execution.
Squeezing the most out of an algorithm in CRAY FORTRAN
This paper discusses a technique for achieving supervector performance on a Cray in a purely Fortran environment (i.e., without resorting to assembly language). The technique can be applied to a wide
ACM Transactions on Mathematical Software
  • ACM Transactions on Mathematical Software
  • 1988
Implementing linear algorithms for dense matrices on a vector pipeline machine
  • SIAM Rev
  • 1984
Implementing linear algorithms for dense matrices on a vector pipeline machine
  • SIAM Rev
  • 1984