Terascale chip multiprocessor memory hierarchy and programming model

Abstract

Small scale chip multiprocessors are being shipped in volume by all microprocessor vendors. Many of these vendors are also investigating large scale chip multiprocessors targeted towards highly parallel workloads in media, graphics, and others. One of the most challenging aspects of architecting terascale processors is the design of a scalable memory hierarchy. Current proposals for providing coherent shared memory in terascale systems require a sophisticated coherence protocol and memory hierarchy. In this paper we propose an alternate memory configuration along with a programming model that significantly simplifies the terascale memory hierarchy. Our proposal still provides fully coherent shared memory but eliminates the hardware coherence protocol. Our programming model enables the programmer to better express the memory characteristic of terascale workloads. Finally, our proposed memory hierarchy performs better and is more scalable than conventional designs.

DOI: 10.1109/HIPC.2009.5433215

11 Figures and Tables

Cite this paper

@article{Yan2009TerascaleCM, title={Terascale chip multiprocessor memory hierarchy and programming model}, author={Shoumeng Yan and Xiaocheng Zhou and Ying Gao and Hu Chen and Sai Luo and Peinan Zhang and Naveen Cherukuri and Ronny Ronen and Bratin Saha}, journal={2009 International Conference on High Performance Computing (HiPC)}, year={2009}, pages={150-159} }