Learn More
Simultaneous multithreading architectures have been defined previously with fully shared execution resources. When one thread in such an architecture experiences a very long-latency operation, such as a load miss, the thread will eventually stall, potentially holding resources which other threads could be using to make forward progress.This paper shows that(More)
As the number of cores increases on chip multiprocessors, coherence is fast becoming a central issue for multi-core performance. This is exacerbated by the fact that interconnection speeds are not scaling well with technology. This paper describes mechanisms to accelerate coherence for a multi-core architecture that has multiple private L2 caches and a(More)
This paper describes initial results for an architecture called the Shared-Thread Multiprocessor (STMP). The STMP combines features of a multithreaded processor and a chip multiprocessor; specifically, it enables distinct cores on a chip multiprocessor to share thread state. This shared thread state allows the system to schedule threads from a shared pool(More)
Previous work on speculative precomputation (SP) on simultaneous multithreaded (SMT) architectures has shown significant benefits. The SP techniques improve single-threaded program performance by utilizing otherwise idle thread contexts to run " helper threads " , which prefetch critical data into shared caches and reduce the time the " main thread " stalls(More)
Kainic acid is known to stimulate the release of glutamate (GLU) and aspartate (ASP) from presynaptic neurons. It has been suggested that the enhanced release of these endogenous EAA's plays a significant role in the excitotoxic effects of KA. Domoic acid (DOM), a shellfish toxin, is structurally similar to KA, and has been shown to be 3-8 times more toxic(More)
— In passive and active measurement projects it is easy to generate large volumes of raw data. Although this allows for detailed analysis, the large volume is an impediment to understanding the data and locating interesting events. One approach to this problem is to use visualization tools to produce a graphical rendering of the data that allows a user to(More)
The most significant source of lost performance when a thread migrates between cores is the loss of cache state. A significant boost in post-migration performance is possible if the cache working set can be moved, proactively, with the thread. This work accelerates thread startup performance after migration by predicting and prefetching the working set of(More)
  • 1