Orchestrated scheduling and prefetching for GPGPUs

@inproceedings{Jog2013OrchestratedSA,
  title={Orchestrated scheduling and prefetching for GPGPUs},
  author={Adwait Jog and Onur Kayiran and Asit K. Mishra and Mahmut T. Kandemir and Onur Mutlu and Ravi R. Iyer and Chita R. Das},
  booktitle={ISCA},
  year={2013}
}
In this paper, we present techniques that coordinate the thread scheduling and prefetching decisions in a General Purpose Graphics Processing Unit (GPGPU) architecture to better tolerate long memory latencies. We demonstrate that existing warp scheduling policies in GPGPU architectures are unable to effectively incorporate data prefetching. The main reason is that they schedule consecutive warps, which are likely to access nearby cache blocks and thus prefetch accurately for one another, back… CONTINUE READING
Highly Cited
This paper has 159 citations. REVIEW CITATIONS

Citations

Publications citing this paper.
Showing 1-10 of 108 extracted citations

Cloud Computing and Big Data

Lecture Notes in Computer Science • 2015
View 11 Excerpts
Highly Influenced

DRAW: investigating benefits of adaptive fetch group size on GPU

2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) • 2015
View 13 Excerpts
Highly Influenced

Locality-Aware CTA Clustering for Modern GPUs

ASPLOS • 2017
View 7 Excerpts
Highly Influenced

CTA-Aware Prefetching and Scheduling for GPU

2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS) • 2018

159 Citations

02040'14'16'18
Citations per Year
Semantic Scholar estimates that this publication has 159 citations based on the available data.

See our FAQ for additional information.

References

Publications referenced by this paper.
Showing 1-9 of 9 references

Many-Thread Aware Prefetching Mechanisms for GPGPU Applications

2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture • 2010
View 4 Excerpts
Highly Influenced

M

T. G. Rogers
O’Connor, and T. M. Aamodt. Cache-Conscious Wavefront Scheduling. In MICRO • 2012
View 3 Excerpts
Highly Influenced

CudaDMA: Optimizing GPU memory bandwidth via warp specialization

2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC) • 2011
View 5 Excerpts
Highly Influenced

Programming Massively Parallel Processors. A Hands-on Approach

Scalable Computing: Practice and Experience • 2010
View 4 Excerpts
Highly Influenced

Rodinia: A benchmark suite for heterogeneous computing

2009 IEEE International Symposium on Workload Characterization (IISWC) • 2009
View 4 Excerpts
Highly Influenced

Mars: A MapReduce Framework on graphics processors

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) • 2008
View 4 Excerpts
Highly Influenced

Similar Papers

Loading similar papers…