Exploiting Fine-Grained Data Parallelism with Chip Multiprocessors and Fast Barriers

  title={Exploiting Fine-Grained Data Parallelism with Chip Multiprocessors and Fast Barriers},
  author={Jack Sampson and Rub{\'e}n Gonz{\'a}lez and Jean-Francois Collard and Norman P. Jouppi and Mike Schlansker and Brad Calder},
  journal={2006 39th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'06)},
We examine the ability of CMPs, due to their lower onchip communication latencies, to exploit data parallelism at inner-loop granularities similar to that commonly targeted by vector machines. Parallelizing code in this manner leads to a high frequency of barriers, and we explore the impact of different barrier mechanisms upon the efficiency of this approach. To further exploit the potential of CMPs for fine-grained data parallel tasks, we present barrier filters, a mechanism for fast barrier… CONTINUE READING
Highly Cited
This paper has 62 citations. REVIEW CITATIONS
40 Citations
8 References
Similar Papers


Publications citing this paper.
Showing 1-10 of 40 extracted citations

63 Citations

Citations per Year
Semantic Scholar estimates that this publication has 63 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-8 of 8 references

Design and implementation of messagepassing services for the Blue Gene/L supercomputer

  • G. Almasi
  • IBM Journal of Research and Development,
  • 2005
Highly Influential
4 Excerpts

Similar Papers

Loading similar papers…