• Publications
  • Influence
AA-Sort: A New Parallel Sorting Algorithm for Multi-Core SIMD Processors
TLDR
In this paper, we propose a new parallel sorting algorithm, called aligned-access sort (AA-sort), for shared-memory multi processors. Expand
  • 127
  • 15
  • PDF
Overview of the IBM Java Just-in-Time Compiler
TLDR
We present the design and implementation of several optimizations and techniques included in the latest IBM JavaTM Just-in-Time (JIT) Compiler. Expand
  • 233
  • 12
  • PDF
A dynamic optimization framework for a Java just-in-time compiler
TLDR
The high performance implementation of Java Virtual Machines (JVM) and just-in-time (JIT) compilers is directed toward adaptive compilation optimizations on the basis of online runtime profile information. Expand
  • 136
  • 8
  • PDF
Design and evaluation of dynamic optimizations for a Java just-in-time compiler
TLDR
The high performance implementation of Java Virtual Machines (JVM) and Just-In-Time (JIT) compilers is directed toward employing a dynamic compilation system on the basis of online runtime profile information. Expand
  • 53
  • 7
A study of devirtualization techniques for a Java Just-In-Time compiler
TLDR
We propose a new technique called direct devirtualization with the code patching mechanism to reduce the runtime overhead of dynamic method calls for Java in a straightforward manner. Expand
  • 128
  • 6
  • PDF
Design, implementation, and evaluation of optimizations in a just-in-time compiler
TLDR
The Java language incurs a runtime overhead for exception checks and object accesses without an interior pointer in order to ensure safety. Expand
  • 90
  • 4
  • PDF
Detection and global optimization of reduction operations for distributed parallel machines
TLDR
This paper presents a new technique for detecting and optimizing reduction operations for parallelizhtg compilers. Expand
  • 40
  • 3
Cloneable JVM: a new approach to start isolated java applications faster
TLDR
We propose a new isolation approach called Cloneable JVM to eliminate this startup overhead in Java. Expand
  • 33
  • 3
  • PDF
MPI microtask for programming the Cell Broadband EngineTM processor
TLDR
We propose a new programming model, MPI microtask, based on the standard Message Passing Interface (MPI) programming model for distributed-memory parallel machines. Expand
  • 82
  • 2
REAL-TIME MUTUAL-INFORMATION-BASED LINEAR REGISTRATION ON THE CELL BROADBAND ENGINE PROCESSOR
TLDR
We have implemented a mutual-information-based 3D linear registration algorithm on the Cell Broadband Enginetrade (CBE) processor, which has nine processor cores on a chip and has a 4-way SIMD unit for each core. Expand
  • 34
  • 2
  • PDF
...
1
2
3
4
5
...