- Full text PDF available (3)
Prefetching reduces cache miss latency by moving data up in memory hierarchy before they are actually needed. Recent hardware-based stride prefetching techniques mostly rely on the processor pipeline information (e.g. program counter and branch prediction table) for prediction. Continuing developments in processor microarchi-tecture drastically change core… (More)
The available Instruction Level Parallelism in Java bytecode (Java-ILP) is not readily exploitable using traditional in-order or out-of-order issue mechanisms due to dependencies involving stack operands. The sequentialization due to stack dependency can be overcome by identifying bytecode traces, which are sequences of bytecode instructions that when… (More)
Manuscript not released forpublication due to legal issues.
High performance on Java applications running on server and desktop machines requires fast execution of Java bytecodes. Such performance can be achieved by Just-In-Time (JIT) compilers, which translate the stack-based bytecodes into register-based machine code on demand. But one crucial problem in Java JIT compilation is the compilation time, which… (More)
The continued scaling of devices in accordance with Moore's law requires activation of some implants such as the source-drain extensions, SDEs, with as little diffusion as possible. New options in thermal processing are described and compared. Thermal flux annealing is the regime where power density is high enough to cause local heating but not so high as… (More)