Erik Lindholm

Learn More
Modern graphics processing units (GPUs) use a large number of hardware threads to hide both function unit and memory access latency. Extreme multithreading requires a complicated thread scheduler as well as a large register file, which is expensive to access both in terms of energy and latency. We present two complementary techniques for reducing energy on(More)
Modern graphics processing units (GPUs) employ a large number of hardware threads to hide both function unit and memory access latency. Extreme multithreading requires a complex thread scheduler as well as a large register file, which is expensive to access both in terms of energy and latency. We present two complementary techniques for reducing energy on(More)
This article consists of a collection of slides from the author's conference presentation on NVIDIA's GeForce 8800 GPU. Some of the specific topics discussed include: an overview of the GeForce 8800 architecture; streamng processor array and processing capabilities; the Raster Operation Pipeline; and GeForce implementation, deployment, and performance(More)
  • 1