• Publications
  • Influence
An Overview of the BlueGene/L Supercomputer
This paper gives an overview of the BlueGene/L Supercomputer. This is a jointly funded research partnership between IBM and the Lawrence Livermore National Laboratory as part of the United StatesExpand
  • 559
  • 21
  • PDF
Work-first and help-first scheduling policies for async-finish task parallelism
TLDR
We address the problem of efficient and scalable implementation of X10's async-finish task parallelism, which is more general than Cilk's spawn-sync parallelism and can easily be integrated into any of the programming models listed above. Expand
  • 163
  • 8
  • PDF
May-happen-in-parallel analysis of X10 programs
TLDR
We introduce a new algorithm for May-Happen-in-Parallel (MHP) analysis of X10 programs. Expand
  • 96
  • 8
  • PDF
Efficient Computation of May-Happen-in-Parallel Information for Concurrent Java Programs
  • R. Barik
  • Computer Science
  • LCPC
  • 20 October 2005
TLDR
We introduce a thread model that is both context and flow sensitive to efficiently compute May-Happen-in-Parallel information for the analysis of multi-threaded programs. Expand
  • 51
  • 6
Extended Linear Scan: An Alternate Foundation for Global Register Allocation
TLDR
In this paper, we extend past work on Linear Scan register allocation, and propose two Extended Linear Scan (ELS) algorithms that retain the compile-time efficiency of past Linear Scan algorithms while delivering performance that can match or surpass that of Graph Coloring. Expand
  • 42
  • 6
  • PDF
Adaptive heterogeneous scheduling for integrated GPUs
TLDR
We present novel scheduling techniques for integrated CPU-GPU processors that leverage online profiling to automatically partition the work of dataparallel kernels between the CPU and GPU. Expand
  • 81
  • 5
  • PDF
The habanero multicore software research project
TLDR
We propose a new approach to multicore software enablement based on a two-level programming model consisting of a higher-level coordination language for domain experts and a lower-level parallel language for programming experts. Expand
  • 80
  • 5
Efficient Selection of Vector Instructions Using Dynamic Programming
TLDR
We present an auto-vectorization framework in the back-end of a dynamic compiler that not only generates optimized vector code but is also well integrated with the instruction scheduler and register allocator. Expand
  • 43
  • 5
  • PDF
Deadlock-free scheduling of X10 computations with bounded resources
TLDR
In this paper, we address the problem of guaranteeing the absence of physical deadlock in the execution of a parallel program using the async, finish, atomic, and place constructs from the X10 language using the Active Message network and the new concept of Doppelgänger mode execution of X10. Expand
  • 54
  • 4
  • PDF
iACT: A Software-Hardware Framework for Understanding the Scope of Approximate Computing
TLDR
This paper discusses an open source toolkit, called iACT (Intel's Approximate Computing Toolkit) to analyze and study the scope of approximations in applications. Expand
  • 58
  • 4
  • PDF
...
1
2
3
4
5
...