• Corpus ID: 211215

FAWNdamentally Power-efficient Clusters

@inproceedings{Vasudevan2009FAWNdamentallyPC,
  title={FAWNdamentally Power-efficient Clusters},
  author={Vijay Vasudevan and Jason Franklin and David G. Andersen and Amar Phanishayee and Lawrence Tan and Michael Kaminsky and Iulian Moraru},
  booktitle={HotOS},
  year={2009}
}
Power is becoming an increasingly large financial and scaling burden for computing and society. The costs of running large data centers are becoming dominated by power and cooling to the degree that companies such as Microsoft and Google have built new data centers close to large and cost-efficient hydroelectric power sources [8]. Studies have projected that by 2012, 3-year data center energy costs will be double that of server equipment expenditures [15]. Power consumption and related cooling… 

Figures and Tables from this paper

Low-power amdahl-balanced blades for data intensive computing
TLDR
This paper proposes an alternative architecture comprising large number of so-called Amdahl blades that combine energy-efficient CPUs with solid state disks to increase sequential read I/O throughput by an order of magnitude while keeping power consumption constant.
FAWN-DLi : A Data Library for a Fast Array of Wimpy Nodes
TLDR
This project extends one existing lowpower system, FAWN-KV, to support more sophisticated sever-side data operations and improves performance by up to a factor of three and outperformed the basic system in all tests.
Web Search Using Small Cores: Quantifying the Price of Efficiency
TLDR
This work evaluates search on server and mobile-class architectures using Xeon and Atom processors, quantifying search efficiency at the microarchitecture and system-level and prompts us toward re-thinking small core designs for a new breed of data center workloads in order to continue reaping the benefits of small-core power efficiency.
Web search using mobile cores: quantifying and mitigating the price of efficiency
TLDR
This work quantifies efficiency for an industry-strength online web search engine in production at both the microarchitecture- and system-level, evaluating search on server and mobile-class architectures using Xeon and Atom processors.
Towards Energy-Efficient Database Cluster Design
TLDR
The cluster design space is explored using empirical results and a model that considers the key bottlenecks to energy efficiency in a parallel DBMS is proposed, representing a key first step in designing energy-efficient database clusters.
REDUCING CLUSTER POWER CONSUMPTION BY DYNAMICALLY SUSPENDING IDLE NODES
TLDR
It is argued that using more realistic workloads would have yielded much better throughput with slightly reduced power consumption, and that the percentage power savings in a larger, more realistically sized cluster would be higher than shown in the results.
Simulation of power consumption of energy efficient cluster hardware
TLDR
A model to estimate power consumption of hardware based on the utilization, which provides an upper bound for any power saving strategy and the minimum power consumption can be determined for a given application.
Models and Control Strategies for Data Center Energy Efficiency
TLDR
This dissertation considers data centers as CPSs, with a focus on run-time management and operating costs and the proposed modeling framework explicitly captures the cyberphysical nature of data centers and allows the development of models that represent both the computational and the thermal characteristics of a data center, as well as their interactions.
A Massively Parallel, Energy Efficient Programmable Accelerator for Learning and Classification
TLDR
The MAPLE architecture is described, its design space is explored with a simulator, how to automatically map application kernels to the hardware is illustrated, and its performance improvement and energy benefits over classic server-based implementations are presented.
An Energy-Efficient Heterogeneous System for Embedded Learning and Classification
TLDR
This letter builds a low-power system using an Atom processor, an ION, a GPU, and a field-programmable gate array (FPGA)-based custom accelerator, and study its performance and power characteristics using four representative workloads.
...
1
2
3
...

References

SHOWING 1-10 OF 23 REFERENCES
Active Disks for Large-Scale Data Processing
TLDR
This work proposes using an active disk storage device that combines on-drive processing and memory with software downloadability to allow disks to execute application-level functions directly at the device.
Delivering Energy Proportionality with Non Energy-Proportional Systems - Optimizing the Ensemble
TLDR
This paper demonstrates how optimization-based techniques can be used to build systems with off-the-shelf hardware that, when viewed at the aggregate level, approximate the behavior of energy-proportional systems.
JouleSort: a balanced energy-efficiency benchmark
TLDR
This work proposes and motivate JouleSort, an external sort benchmark, for evaluating the energy efficiency of a wide range of computer systems from clusters to handhelds, and demonstrates a Joule sort system that is over 3.5x as energy-efficient as last year's estimated winner.
On the Effects of Memory Latency and Bandwidth on Supercomputer Application Performance
  • R. Murphy
  • Computer Science
    2007 IEEE 10th International Symposium on Workload Characterization
  • 2007
TLDR
This paper compares the memory performance sensitivity of both traditional and emerging HPC applications, and shows that the new codes are significantly more sensitive to memory latency and bandwidth than their traditional counterparts.
Trade-Offs Between Voltage Scaling and Processor Shutdown for Low-Energy Embedded Multiprocessors
TLDR
Scheduling heuristics are presented that determine the best trade-off between DVS, processor shutdown, and finding the optimal number of processors, and it is shown that the best approach leaves little room for improvement.
A case for redundant arrays of inexpensive disks (RAID)
TLDR
Five levels of RAIDs are introduced, giving their relative cost/performance, and a comparison to an IBM 3380 and a Fujitsu Super Eagle is compared.
Overview of the Blue Gene/L system architecture
TLDR
The key architectural features of BlueGene/L are introduced: the link chip component and five Blue Gene/L networks, the PowerPC® 440 core and floating-point enhancements, the on-chip and off-chip distributed memory system, the node- and system-level design for high reliability, and the comprehensive approach to fault isolation.
Dynamo: amazon's highly available key-value store
TLDR
D Dynamo is presented, a highly available key-value storage system that some of Amazon's core services use to provide an "always-on" experience and makes extensive use of object versioning and application-assisted conflict resolution in a manner that provides a novel interface for developers to use.
Evaluation of Existing Architectures in IRAM Systems
TLDR
This work examined both execution time analyses of existing microprocessors and system simulation of hypothetical processors to determine whether existing microarchitectures can tap the potential performance advantages of IRAM systems.
Diamond: A Storage Architecture for Early Discard in Interactive Search
TLDR
An informal user study of an image retrieval application supports the belief that early discard significantly improves the quality of interactive searches.
...
1
2
3
...