CLTune: A Generic Auto-Tuner for OpenCL Kernels

@article{Nugteren2015CLTuneAG,
  title={CLTune: A Generic Auto-Tuner for OpenCL Kernels},
  author={Cedric Nugteren and Valeriu Codreanu},
  journal={2015 IEEE 9th International Symposium on Embedded Multicore/Many-core Systems-on-Chip},
  year={2015},
  pages={195-202}
}
This work presents CLTune, an auto-tuner for OpenCL kernels. It evaluates and tunes kernel performance of a generic, user-defined search space of possible parameter-value combinations. Example parameters include the OpenCL workgroup size, vector data-types, tile sizes, and loop unrolling factors. CLTune can be used in the following scenarios: 1) when there are too many tunable parameters to explore manually, 2) when performance portability across OpenCL devices is desired, or 3) when the… CONTINUE READING
Highly Cited
This paper has 23 citations. REVIEW CITATIONS
Recent Discussions
This paper has been referenced on Twitter 7 times over the past 90 days. VIEW TWEETS

Citations

Publications citing this paper.

References

Publications referenced by this paper.
Showing 1-10 of 23 references

et al

  • J. Bergstra, F. Bastien, +7 authors A. Bergeron
  • Theano: Deep Learning on GPUs with Python. In…
  • 2011
Highly Influential
4 Excerpts

Similar Papers

Loading similar papers…