Learn More
We develop a model for the parallel performance of algorithms that consist of concurrent, two-dimensional wavefronts implemented in a message passing environment. The model, based on a LogGP machine parametrization, combines the separate contributions of computation and communication wavefronts. We validate the model on three important supercomputer systems(More)
The POEMS project is creating an environment for end-to-end performance modeling of complex parallel and distributed systems, spanning the domains of application software, runtime and operating system software, and hardware architecture. To enable end-to-end modeling of large-scale applications and systems, the POEMS framework is designed to compose models(More)
We develop a model for the parallel performance of algorithms that consist of concurrent, two-dimensional wavefronts implemented in a message passing environment. The model combines the separate contributions of computation and communication wavefronts. We validate the model on three supercomputer systems, with up to 500 processors, using data from an ASCI(More)
We present a model for the parallel performance of algorithms that consist of concurrent, two-dimensional wavefronts implemented in a message passing environment. The model combines the separate contributions of computation and communication wave-fronts. We validate the model on three important supercomputer systems, on up to 500 processors. We use data(More)
"~tor super-rs manu-~ non-9mpanies the Fu-n'P-200, the Hitachi $810/20, and the NEC SX-2, which appeared as early as 1983. An early performance evaluation of the VP-200 and the $810/20, using benchmark codes designed to represent the Los A lamos National Laboratory (LANL) computing work-load, showed that the VP-200 could be two to three times as fast as a(More)
In this paper we compare single-processor performance of the SGI Origin and PowerChallenge and utilize a previously-reported performance model for hierarchical memory systems to explain the results. Both the Origin and PowerChallenge use the same microprocessor (MIPS R10000) but have significant differences in their memory subsystems. Our memory model(More)
In this paper, we characterize application performance with a "memory-centric" view. Using a simple strategy and performance data measured on actual machines, we model the performance of a simple memory hierarchy and infer the contribution of each level in the memory system to an application's overall cycles per instruction (cpi). Included are results(More)