• Publications
  • Influence
PSINS: An Open Source Event Tracer and Execution Simulator
As the size of today’s supercomputers grow exponentially in numbers of processors, the applications that run on these systems scale to larger processor counts. The majority of these applicationsExpand
  • 83
  • 9
PEBIL: Efficient static binary instrumentation for Linux
Binary instrumentation facilitates the insertion of additional code into an executable in order to observe or modify the executable's behavior. There are two main approaches to binaryExpand
  • 141
  • 8
Efficient instrumentation for code coverage testing
Evaluation of Code Coverage is the problem of identifying the parts of a program that did not execute in one or more runs of a program. The traditional approach for code coverage tools is to useExpand
  • 139
  • 5
NUMA-aware Java heaps for server applications
We introduce a set of techniques to both measure and optimize memory access locality of Java applications running on cc-NUMA servers. These techniques work at the object level and use informationExpand
  • 31
  • 5
High-frequency simulations of global seismic wave propagation using SPECFEM3D_GLOBE on 62K processors
SPECFEM3D_GLOBE is a spectral-element application enabling the simulation of global seismic wave propagation in 3D anelastic, anisotropic, rotating and self-gravitating Earth models at unprecedentedExpand
  • 25
  • 4
High-frequency simulations of global seismic wave propagation using SPECFEM3D_GLOBE on 62K processors
SPECFEM3D_GLOBE is a spectral element application enabling the simulation of global seismic wave propagation in 3D anelastic, anisotropic, rotating and self-gravitating Earth models at unprecedentedExpand
  • 50
  • 3
An idiom-finding tool for increasing productivity of accelerators
Suppose one is considering purchase of a computer equipped with accelerators. Or suppose one has access to such a computer and is considering porting code to take advantage of the accelerators. IsExpand
  • 33
  • 3
A genetic algorithms approach to modeling the performance of memory-bound computations
Benchmarks that measure memory bandwidth, such as STREAM, Apex-MAPS and MultiMAPS, are increasingly popular due to the "Von Neumann" bottleneck of modern processors which causes many calculations toExpand
  • 75
  • 2
DARPA's HPCS Program- History, Models, Tools, Languages
Abstract The historical context with regard to the origin of the DARPA High Productivity Computing Systems (HPCS) program is important for understanding why federal government agencies launched thisExpand
  • 40
  • 2
Hardware monitors for dynamic page migration
In this paper, we first introduce a profile-driven online page migration scheme and investigate its impact on the performance of multithreaded applications. We use centralized lightweight,Expand
  • 45
  • 2