Efficient Techniques for Fast Nested Barrier Synchronization


Two hardware barrier synchronization schemes are presented which can support deep levels of control nesting in data parallel programs. Hardware barriers are usually an order of magnitude faster than software implementations. Since large data parallel programs often have several levels of nested barriers, these schemes provide significant speedups in the execution of such programs on MIMD computers. The first scheme performs code transformations and uses two single-bit-trees to implement unlimited levels of nested barriers. However, this scheme increases the code size. The second scheme uses a more expensive integer-tree to support an exponential number of nested barriers without increasing the code size. Using hardware already available on commercial MIMD computers, this scheme can support more than four billion levels of nesting.

DOI: 10.1145/215399.215436

Extracted Key Phrases

8 Figures and Tables

Cite this paper

@inproceedings{Ramakrishnan1995EfficientTF, title={Efficient Techniques for Fast Nested Barrier Synchronization}, author={Vara Ramakrishnan and Isaac D. Scherson and Raghu Subramanian}, booktitle={SPAA}, year={1995} }