Simultaneous multithreading architectures have been defined previously with fully shared execution resources. When one thread in such an architecture experiences a very long-latency operation, such as a load miss, the thread will eventually stall, potentially holding resources which other threads could be using to make forward progress.This paper shows that… (More)
This paper describes initial results for an architecture called the Shared-Thread Multiprocessor (STMP). The STMP combines features of a multithreaded processor and a chip multiprocessor; specifically, it enables distinct cores on a chip multiprocessor to share thread state. This shared thread state allows the system to schedule threads from a shared pool… (More)
As the number of cores increases on chip multiprocessors, coherence is fast becoming a central issue for multi-core performance. This is exacerbated by the fact that interconnection speeds are not scaling well with technology. This paper describes mechanisms to accelerate coherence for a multi-core architecture that has multiple private L2 caches and a… (More)
Previous work on speculative precomputation (SP) on simultaneous multithreaded (SMT) architectures has shown significant benefits. The SP techniques improve single-threaded program performance by utilizing otherwise idle thread contexts to run " helper threads " , which prefetch critical data into shared caches and reduce the time the " main thread " stalls… (More)
— In passive and active measurement projects it is easy to generate large volumes of raw data. Although this allows for detailed analysis, the large volume is an impediment to understanding the data and locating interesting events. One approach to this problem is to use visualization tools to produce a graphical rendering of the data that allows a user to… (More)
The most significant source of lost performance when a thread migrates between cores is the loss of cache state. A significant boost in post-migration performance is possible if the cache working set can be moved, proactively, with the thread. This work accelerates thread startup performance after migration by predicting and prefetching the working set of… (More)
This paper presents some guidelines which may be used to evaluate extensions to APL. These guidelines are then applied to some proposed APL extensions including complex arithmetic, functions on non-simple arrays and operators.
Perchlorate is a known environmental contaminant, largely due to widespread military use as a propellant. Perchlorate acts pharmacologically as a competitive inhibitor of thyroidal iodide uptake in mammals, but the impacts of perchlorate contamination in aquatic ecosystems and, in particular, the effects on fish are unclear. Our studies aimed to investigate… (More)