• Publications
  • Influence
The Perfect Club Benchmarks: Effective Performance Evaluation of Supercomputers
TLDR
A methodology for measuring the performance of supercomputers, including 13 Fortran programs that total over 50,000 lines of source code, and a set of guidelines that allow portability to several types of machines are presented. Expand
The design of linear algebra libraries for high performance computers
This paper discusses the design of linear algebra libraries for high performance computers. Particular emphasis is placed on the development of scalable algorithms for MIMD distributed memoryExpand
A look at scalable dense linear algebra libraries
Discusses the essential design features of a library of scalable software for performing dense linear algebra computations on distributed memory concurrent computers. The square block scatteredExpand
The design library: of a parallel dense linear algebra software Reduction to Hessenberg, tridiagonal, and bidiagonal form*
TLDR
There is a tradeoff between efficiency and software engineering considerations, such as ease of programming and simplicity of code, in the design of ScaLAPACK, a software library for performing dense linear algebra computations on distributed memory concurrent computers. Expand
Benchmarking Advanced Architecture Computers
TLDR
The Crays offer the best performance on the benchmark suite; the shared memory multiprocessor machines generally permitted some parallelism, and when coupled with substantial floating point capabilities (as in the Alliant FX/8 and Sequent Symmetry), provided an order of magnitude less speed than the CRAYs. Expand
A comparison of the performance of the Caltech Mark 2 hypercube and the Elxsi 6400
TLDR
The implementation of the pre-conditioned, conjugate gradient (pcg) algorithm on both the Caltech hypercube and the Elxsi 6400 are discussed and it was found that on both machines the pcg part of the code runs with efficiency greater than 90% for all problems large enough to be of interest. Expand
Theory and Algorithms for Parallel Computation
TLDR
The Theory and Algorithms sessions at EuroPar '98 will cover a number of important issues concerning models of parallel computation including the BSP model and its relationship to LogP. Expand
A portable programming environment for multiprocessors
This paper discusses the issues involved in designing a portable programming environment for large grain-size MIMD concurrent processors. In addition, a detailed description of such an environment isExpand
...
1
2
...