Efficient utilization of GPGPU cache hierarchy

  title={Efficient utilization of GPGPU cache hierarchy},
  author={Mahmoud Khairy and Mohamed Zahran and Amr G. Wassal},
Recent GPUs are equipped with general-purpose L1 and L2 caches in an attempt to reduce memory bandwidth demand and improve the performance of some irregular GPGPU applications. However, due to the massive multithreading, GPGPU caches suffer from severe resource contention and low data-sharing which may degrade the performance instead. In this work, we propose three techniques to efficiently utilize and improve the performance of GPGPU caches. The first technique aims to dynamically detect and… CONTINUE READING
Highly Cited
This paper has 20 citations. REVIEW CITATIONS
14 Citations
14 References
Similar Papers


Publications citing this paper.
Showing 1-10 of 14 extracted citations


Publications referenced by this paper.
Showing 1-10 of 14 references

and M

  • W. Jia, K. A. Shaw
  • a. Martonosi. MRPB: Memory Request Prioritization…
  • 2014
Highly Influential
20 Excerpts


  • T. G. Rogers
  • O’Connor, and T. M. Aamodt. Divergence-aware warp…
  • 2013
Highly Influential
13 Excerpts

Similar Papers

Loading similar papers…