Autogeneration and Autotuning of 3D Stencil Codes on Homogeneous and Heterogeneous GPU Clusters

@article{Zhang2013AutogenerationAA,
  title={Autogeneration and Autotuning of 3D Stencil Codes on Homogeneous and Heterogeneous GPU Clusters},
  author={Yongpeng Zhang and Frank Mueller},
  journal={IEEE Transactions on Parallel and Distributed Systems},
  year={2013},
  volume={24},
  pages={417-427}
}
This paper develops and evaluates search and optimization techniques for autotuning 3D stencil (nearest neighbor) computations on GPUs. Observations indicate that parameter tuning is necessary for heterogeneous GPUs to achieve optimal performance with respect to a search space. Our proposed framework takes a most concise specification of stencil behavior from the user as a single formula, autogenerates tunable code from it, systematically searches for the best configuration and generates the… CONTINUE READING
Highly Cited
This paper has 29 citations. REVIEW CITATIONS

13 Figures & Tables

Topics

Statistics

051015201620172018
Citations per Year

Citation Velocity: 7

Averaging 7 citations per year over the last 3 years.

Learn more about how we calculate this metric in our FAQ.