Learn More
With the advent of new routing methods, the distance to which a message is sent is becoming relatively less and less important. Thus, assuming no link contention, permutation seems to be an eecient collective communication primitive. In this paper we present several algorithms for decomposing all-to-many personalized communication into a set of disjoint(More)
Supporting source-level performance analysis of programs written in data-parallel languages requires a unique degree of integration between compilers and performance analysis tools. Compilers for languages such as High Performance Fortran infer parallelism and communication from data distribution directives, thus, performance tools cannot meaningfully(More)
With the advent of new routing methods, the distance to which a message is sent is becoming relatively less and less important. Thus, assuming no link contention, permutation seems to be an eecient collective communication primitive. In this paper we present several algorithms for decomposing all-to-many personalized communication into a set of disjoint(More)
In this paper, we present several algorithms for per forming all-to-many personalized communication on distributed memory parallel machines. Each proces sor sends a different message (of potentially different size) to a subset of all the processors involved in the collective communication. The algorithms are based on decomposing the communication matrix(More)
In this paper, we present several algorithms for performing all-to-many personalized communication on distributed memory parallel machines. Each processor sends a diierent message (of potentially diierent size) to a subset of all the processors involved in the collective communication. The algorithms are based on decomposing the communication matrix into a(More)
A communication package, Non-uniform Irregular Communication Exchange (NICE), is designed to help users in scheduling message-passing requests on distributed-memory machines. This package schedules a batch of messages into a set of partial permutations and provides communication primitives to carry out the communication. The NICE primitives are fo-cused on(More)