CLBlast: A Tuned OpenCL BLAS Library

@inproceedings{Nugteren2018CLBlastAT,
  title={CLBlast: A Tuned OpenCL BLAS Library},
  author={Cedric Nugteren},
  booktitle={IWOCL},
  year={2018}
}
This work introduces CLBlast, an open-source BLAS library providing optimized OpenCL routines to accelerate dense linear algebra for a wide variety of devices. It is targeted at machine learning and HPC applications and thus provides a fast matrix-multiplication routine (GEMM) to accelerate the core of many applications (e.g. deep learning, iterative solvers, astrophysics, computational fluid dynamics, quantum chemistry). CLBlast has five main advantages over other OpenCL BLAS libraries: 1) it… CONTINUE READING
Recent Discussions
This paper has been referenced on Twitter 1 time over the past 90 days. VIEW TWEETS