Hybrid CUDA, OpenMP, and MPI parallel programming on multicore GPU clusters

  title={Hybrid CUDA, OpenMP, and MPI parallel programming on multicore GPU clusters},
  author={Chao-Tung Yang and Chih-Lin Huang and Cheng-Fang Lin},
  journal={Computer Physics Communications},
a r t i c l e i n f o a b s t r a c t Nowadays, NVIDIA's CUDA is a general purpose scalable parallel programming model for writing highly parallel applications. It provides several key abstractions – a hierarchy of thread blocks, shared memory, and barrier synchronization. This model has proven quite successful at programming multithreaded many core GPUs and scales transparently to hundreds of cores: scientists throughout industry and academia are already using CUDA to achieve dramatic speedups… CONTINUE READING
Highly Cited
This paper has 61 citations. REVIEW CITATIONS
40 Citations
5 References
Similar Papers


Publications citing this paper.
Showing 1-10 of 40 extracted citations

61 Citations

Citations per Year
Semantic Scholar estimates that this publication has 61 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-5 of 5 references

OpenGL(R) Programming Guide: The Official Guide to Learning OpenGL(R), Version 2.1, 6th edition, Addison–Wesley Professional

  • Board, Dave Shreiner, Mason Woo, Jackie Neider, Tom Davis
  • OpenGL Architecture
  • 2007
1 Excerpt

Similar Papers

Loading similar papers…