Learn More
Minimizing latency and maximizing throughput are important goals in the design of routing algorithms for interconnection networks. Ideally, we would like a routing algorithm to (a) route packets using the minimal number of hops to reduce latency and preserve communication locality, (b) deliver good worst-case and average-case throughput and (c) enable(More)
Streaming is emerging as an important programming model for multicores. Streaming provides an elegant way to express task decomposition and inter-task communication, while hiding laborious orchestration details such as load balancing, assignment (of stream computation to nodes) and computation/communication scheduling from the programmer. This paper(More)
Crossbar arbitration - which determines the allocation of output ports to packets in the input queues - is a performance-critical stage in the overall performance of routers for input-queued networks. The overall performance of crossbar arbitration depends on two metrics: (a) matching power - the ability of the arbiter to maximize the number of matches(More)
This is to certify that the thesis prepared By Entitled Complies with University regulations and meets the standards of the Graduate School for originality and quality For the degree of Final examining committee members , Chair ii To Boo, who knows how to relax. To my family, thank you all for your support. iii ACKNOWLEDGMENTS This project has undergone(More)
Researchers have proposed clustered microarchitectures for performance and energy effciency. Typically, clustered microarchitectures offer fast, local bypassing between instructions within clusters but global bypasses are slower. Traditional clustered microarchitectures (TCM) are implemented by partitioning the register file and associated functional units(More)
  • 1