Skip to search formSkip to main contentSkip to account menu

BLAS

Known as: AXPY, CGEMM, DGEMM 
Basic Linear Algebra Subprograms (BLAS) is a specification that prescribes a set of low-level routines for performing common linear algebra… 
Wikipedia (opens in a new tab)

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
2015
2015
This paper describes maxDNN, a computationally efficient convolution kernel for deep learning with the NVIDIA Maxwell GPU. maxDNN… 
2015
2015
OpenCL is an open standard to write parallel applications for heterogeneous computing systems. Since its usage is restricted to a… 
2012
2012
This paper presents results of an implementation of code generator for fast general matrix multiply (GEMM) kernels. When a set of… 
2011
2011
Achieving high-performance while reducing power consumption is a key concern as technology scaling is reaching its limits. It is… 
2011
2011
In recent years, the use of graphics chips has been recognized as a viable way of accelerating scientic and engineering… 
2007
2007
This paper presents a new practical tuning method for fractional order proportional and integral controller (FO-PI). The plant to… 
2005
2005
We describe the design of a dual-issue single-instruction, multiple-data-like (SIMD-like) extension of the IBM PowerPC® 440… 
2004
2004
In this paper, we extend the theory of algorithmic fault-tolerant matrix-matrix multiplication, C = AB, in a number of ways…