Performance evaluation of OpenMP's target construct on GPUs - exploring compiler optimisations
@article{Hayashi2019PerformanceEO, title={Performance evaluation of OpenMP's target construct on GPUs - exploring compiler optimisations}, author={Akihiro Hayashi and J. Shirako and Ettore Tiotto and R. Ho and Vivek Sarkar}, journal={Int. J. High Perform. Comput. Netw.}, year={2019}, volume={13}, pages={54-69} }
OpenMP is a directive-based shared memory parallel programming model and has been widely used for many years. From OpenMP 4.0 onwards, GPU platforms are supported by extending OpenMP's high-level parallel abstractions with accelerator programming. This extension allows programmers to write GPU programs in standard C/C++ or Fortran languages, without exposing too many details of GPU architectures. However, such high-level programming models generally impose additional program optimisations on… Expand
Figures, Tables, and Topics from this paper
5 Citations
OpenMP: Portable Multi-Level Parallelism on Modern Systems: 16th International Workshop on OpenMP, IWOMP 2020, Austin, TX, USA, September 22–24, 2020, Proceedings
- Computer Science
- IWOMP
- 2020
Memory Efficient High-Performance Rotational Image Encryption
- 2019 International Conference on Communication and Electronics Systems (ICCES)
- 2019
References
SHOWING 1-10 OF 26 REFERENCES
Exploring Compiler Optimization Opportunities for the OpenMP 4.× Accelerator Model on a POWER8+GPU Platform
- Computer Science
- 2016 Third Workshop on Accelerator Programming Using Directives (WACCPD)
- 2016
- 8
- PDF
Compiling and Optimizing Java 8 Programs for GPU Execution
- Computer Science
- 2015 International Conference on Parallel Architecture and Compilation (PACT)
- 2015
- 42
- PDF
Coordinating GPU Threads for OpenMP 4.0 in LLVM
- Computer Science
- 2014 LLVM Compiler Infrastructure in HPC
- 2014
- 48
OpenMPC: Extended OpenMP Programming and Tuning for GPUs
- Computer Science
- 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
- 2010
- 233
- PDF
Compiling a high-level language for GPUs: (via language support for architectures and compilers)
- Computer Science
- PLDI '12
- 2012
- 117
- PDF
Performance analysis of OpenMP on a GPU using a CORAL proxy application
- Computer Science
- PMBS '15
- 2015
- 28
Performance Analysis and Optimization of Clang's OpenMP 4.5 GPU Support
- Computer Science
- 2016 7th International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS)
- 2016
- 18
- PDF
Optimized two-level parallelization for GPU accelerators using the polyhedral model
- Computer Science
- CC
- 2017
- 9
Machine-Learning-based Performance Heuristics for Runtime CPU/GPU Selection
- Computer Science
- PPPJ
- 2015
- 22
- PDF