• Publications
  • Influence
Exploring Traditional and Emerging Parallel Programming Models Using a Proxy Application
Parallel machines are becoming more complex with increasing core counts and more heterogeneous architectures. However, the commonly used parallel programming models, C/C++ with MPI and/or OpenMP,Expand
  • 148
  • 29
  • Open Access
Hierarchical Load Balancing for Charm++ Applications on Large Supercomputers
Large parallel machines with hundreds of thousands of processors are being built. Recent studies have shown that ensuring good load balance is critical for scaling certain classes of parallelExpand
  • 64
  • 7
  • Open Access
Scalable molecular dynamics with NAMD on the IBM Blue Gene/L system
NAMD (nanoscale molecular dynamics) is a production molecular dynamics (MD) application for biomolecular simulations that include assemblages of proteins, cell membranes, and water molecules. In aExpand
  • 84
  • 5
  • Open Access
There goes the neighborhood: Performance degradation due to nearby jobs
Predictable performance is important for understanding and alleviating application performance issues; quantifying the effects of source code, compiler, or system software changes; estimating theExpand
  • 134
  • 5
  • Open Access
Automated mapping of regular communication graphs on mesh interconnects
Network contention has a significantly adverse effect on the performance of parallel applications with increasing size of parallel machines. Machines of the petascale era are forcing applicationExpand
  • 56
  • 5
  • Open Access
Dynamic topology aware load balancing algorithms for molecular dynamics applications
Molecular Dynamics applications enhance our understanding of biological phenomena through bio-molecular simulations. Large-scale parallelization of MD simulations is challenging because of the smallExpand
  • 89
  • 4
  • Open Access
Mapping applications with collectives over sub-communicators on torus networks
The placement of tasks in a parallel application on specific nodes of a supercomputer can significantly impact performance. Traditionally, this task mapping has focused on reducing the distanceExpand
  • 54
  • 4
  • Open Access
Avoiding hot-spots on two-level direct networks
A low-diameter, fast interconnection network is going to be a prerequisite for building exascale machines. A two-level direct network has been proposed by several groups as a scalable design forExpand
  • 66
  • 4
  • Open Access
Heuristic-Based Techniques for Mapping Irregular Communication Graphs to Mesh Topologies
  • A. Bhatele, L. Kalé
  • Computer Science
  • IEEE International Conference on High Performance…
  • 2 September 2011
Mapping of parallel applications on the network topology is becoming increasingly important on large supercomputers. Topology aware mapping can reduce the hops traveled by messages on the network andExpand
  • 29
  • 4
  • Open Access
Overcoming the Scalability Challenges of Epidemic Simulations on Blue Waters
Modeling dynamical systems represents an important application class covering a wide range of disciplines including but not limited to biology, chemistry, finance, national security, and health care.Expand
  • 36
  • 4
  • Open Access