BLASX: A High Performance Level-3 BLAS Library for Heterogeneous Multi-GPU Computing

@inproceedings{Wang2016BLASXAH,
  title={BLASX: A High Performance Level-3 BLAS Library for Heterogeneous Multi-GPU Computing},
  author={Linnan Wang and Wei Wu and Jianxiong Xiao and Yi Yang},
  booktitle={ICS},
  year={2016}
}
Basic Linear Algebra Subprograms (BLAS) are a set of low level linear algebra kernels widely adopted by applications involved with the deep learning and scientific computing. The massive and economic computing power brought forth by the emerging GPU architectures drives interest in implementation of compute-intensive level 3 BLAS on multi-GPU systems. In this paper, we investigate existing multi-GPU level 3 BLAS and present that 1) issues, such as the improper load balancing, inefficient… CONTINUE READING
Highly Cited
This paper has 23 citations. REVIEW CITATIONS
15 Citations
2 References
Similar Papers

Citations

Publications citing this paper.
Showing 1-10 of 15 extracted citations

References

Publications referenced by this paper.
Showing 1-2 of 2 references

Similar Papers

Loading similar papers…