Toward multi-target autotuning for accelerators

@article{Chaimov2014TowardMA,
  title={Toward multi-target autotuning for accelerators},
  author={Nicholas Chaimov and Boyana Norris and Allen D. Malony},
  journal={2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)},
  year={2014},
  pages={534-541}
}
Producing high-performance implementations from simple, portable computation specifications is a challenge that compilers have tried to address for several decades. More recently, a relatively stable architectural landscape has evolved into a set of increasingly diverging and rapidly changing CPU and accelerator designs, with the main common factor being dramatic increases in the levels of parallelism available. The growth of architectural heterogeneity and parallelism, combined with the very… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-9 OF 9 CITATIONS

AIWC: OpenCL-Based Architecture-Independent Workload Characterization

  • 2018 IEEE/ACM 5th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC)
  • 2018

Autotuning GPU Kernels via Static and Predictive Analysis

  • 2017 46th International Conference on Parallel Processing (ICPP)
  • 2017
VIEW 2 EXCERPTS
CITES BACKGROUND & METHODS

Tuning OpenCL Applications with the Periscope Tuning Framework

  • 2016 49th Hawaii International Conference on System Sciences (HICSS)
  • 2016
VIEW 1 EXCERPT
CITES BACKGROUND

References

Publications referenced by this paper.
SHOWING 1-10 OF 18 REFERENCES

Scalable parallel programming with CUDA

  • 2008 IEEE Hot Chips 20 Symposium (HCS)
  • 2008
VIEW 15 EXCERPTS
HIGHLY INFLUENTIAL

A parametric multi - level tiler for imperfect loop nests

M. M. B ASKARAN, C. B ASTOUL, +4 authors P. PrimeTile
  • 2013

AND SADAYAPPAN, P. Stencil-aware GPU optimization of iterative solvers

C. CHOUDARY, J. GODWIN, +5 authors G. SABIN
  • SIAM Journal on Scientific Computing
  • 2013
VIEW 1 EXCERPT

Autotuning Stencil-Based Computations on GPUs

  • 2012 IEEE International Conference on Cluster Computing
  • 2012
VIEW 1 EXCERPT

Annotation-based empirical performance tuning using Orio

  • 2009 IEEE International Symposium on Parallel & Distributed Processing
  • 2009
VIEW 1 EXCERPT

Similar Papers

Loading similar papers…