Dong-yuan Chen

Learn More
Traditional software controlled data cache prefetching isoften ineffective due to the lack of runtime cache miss andmiss address information. To overcome this limitation, weimplement runtime data cache prefetching in the dynamicoptimization system ADORE (ADaptive Object code RE-optimization).Its performance has been compared withstatic software prefetching(More)
Dynamic compilers (or Just-in-Time [JIT] compilers) are a key component of managed runtime environments. This paper describes the design and implementation of the StarJIT compiler, a dynamic compiler for Java Virtual Machines and Common Language Runtime platforms. The goal of the StarJIT compiler is to build an infrastructure to research the influence of(More)
Optimizing programs at run-time provides opportunities to apply aggressive optimizations to programs based on information that was not available at compile time. At run time, programs can be adapted to better exploit architectural features, optimize the use of dynamic libraries, and simplify code based on run-time constants.Our profiling system provides a(More)
In this paper, we evaluate the emerging Transactional Memory (TM) area by developing a set of Java transactional memory workloads and studying their performance under a Java Software Transactional Memory (STM) system and comparing them to their lock based counterparts. We provide a detailed performance and memory consumption analysis of the overheads of(More)
Planar cell polarity (PCP) information is a critical determinant of organ morphogenesis. While PCP in bounded epithelial sheets is increasingly well understood, how PCP is organized in tubular and acinar tissues is not. Drosophila egg chambers (follicles) are an acinus-like "edgeless epithelium" and exhibit a continuous, circumferential PCP that does not(More)
Smaller input data sets such as the test and the train input sets are commonly used in simulation to estimate the impact of architecture/micro-architecture features on the performance of SPEC benchmarks. They are also used for profile feedback compiler optimizations. In this paper, we examine the reliability of reduced input sets for performance simulation(More)
Protein tyrosine phosphatases (PTPs) are a group of tightly regulated enzymes that coordinate with protein tyrosine kinases to control protein phosphorylation during various cellular processes. Using genetic analysis in Drosophila non-transmembrane PTPs, we identified one role that Myopic (Mop), the Drosophila homolog of the human His domain phosphotyrosine(More)
<i>Computation-reuse and value-prediction are two recent techniques for improving microprocessor performance by exploiting value localities. They both aim at breaking the data dependence limit in traditional processors. In this paper, we propose a speculative multithreading scheme in which the same hardware can be efficiently used for both computation reuse(More)
To date our understanding of Drosophila receptor protein tyrosine phosphatases (R-PTPs) in the regulation of signal transduction is limited. Of the seven R-PTPs identified in flies, six are involved in the axon guidance that occurs during embryogenesis. However, whether and how R-PTPs may control key steps of Drosophila development is not clear. In this(More)
Optimization of a real world application BLAST is used to demonstrate the limitations of static and profile-guided optimizations and to highlight the potential of runtime optimization systems. We analyze the performance profile of this application to determine performance bottlenecks and evaluate the effect of aggressive compiler optimizations on BLAST. We(More)