• Corpus ID: 62265802

ECOFIT: A Framework to Estimate Energy Consumption of Fault Tolerance protocols during HPC executions

@inproceedings{Diouri2013ECOFITAF,
  title={ECOFIT: A Framework to Estimate Energy Consumption of Fault Tolerance protocols during HPC executions},
  author={Mohammed el Mehdi Diouri and Olivier Gl{\"u}ck and Laurent Lef{\`e}vre and Franck Cappello},
  booktitle={IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing},
  year={2013},
  url={https://api.semanticscholar.org/CorpusID:62265802}
}
An energy estimation framework that relies on an energy calibration of the considered platform and a user description of the execution setting to allow selecting the best fault tolerant protocol without pre-executing the application.

Energy-Aware Checkpointing Strategies

This chapter describes in this chapter a methodology to estimate the energy consumption of the fault-tolerant protocols used for HPC applications and evaluates the accuracy of the estimations with applications and scenarios executed on a real platform with energy consumption monitoring.

Understanding Practical Tradeoffs in HPC Checkpoint-Scheduling Policies

This work provides an extensive analysis of the performance, energy and I/O costs associated with a wide array of checkpointing policies and proposes methods to optimize checkpoint scheduling for energy savings and evaluates the runtime-optimized and energy- Optimized policies using simulations based on failure logs from 10 production HPC clusters.

To checkpoint or not to checkpoint: Understanding energy-performance-I/O tradeoffs in HPC checkpointing

This paper provides an extensive analysis of the energy/ performance tradeoffs associated with an array of checkpoint scheduling policies, including policies that are proposed, as well as few existing ones in the literature.

Language-Based Expression of Reliability and Parallelism for Low-Power Computing

The results obtained indicate that it is feasible to write programs that remain adaptable after compilation in order to improve the ability to balance reliability, power, and performance.

Exploiting Field Data Analysis to Improve the Reliability and Energy-efficiency of HPC Systems

The efficient design and operation of such large-scale installations critically relies on developing an in-depth understanding of their failure behaviour as well as their energy consumption.

Providing Green Services in HPC Data Centers: A Methodology Based on Energy Estimation

Reducing the energy consumption of high-performance computing infrastructures is a major challenge for the next years in order to be able to move to the exascale era.

SESAMES: A Smart-Grid Based Framework for Consuming Less and Better in Extreme-Scale Infrastructures

SESAMES is a smart and energy-aware service-oriented architecture manager that proposes energy-efficient services for exascale applications and provides an optimized reservation scheduling and the new features of this framework which are the design of a smart grid and a multi-criteria green job scheduler.

Energy estimation for MPI broadcasting algorithms in large scale HPC systems

A framework to estimate the energy consumed by different MPI broadcasting algorithms for various execution settings is proposed and results show that these estimations are highly accurate and allow to select the least consuming broadcasting algorithm.

Contributions à la flexibilité et à l'efficacité énergétique des systèmes distribués à grande échelle

Les systemes distribues a grande echelle (Datacenters, Grilles, Clouds, Reseaux) sont des acteurs incontournables dans notre societe de communication et d'echanges electroniques. Ces infrastructures

Efficacité énergétique dans le calcul très haute performance : application à la tolérance aux pannes et à la diffusion de données. (Energy efficiency in very high-performance computing : application to fault tolerance and data broadcasting)

    M. Diouri
    Political Science, Computer Science
  • 2013
L'ordonnanceur multi-criteres des reservations de ressources propose dans SESAMES permet de reduire a la fois the consommation energetique, le cout financier and l'impact environnemental of ces reservations, tout en respectant les contraintes imposees par l'utilisateur and le fournisseur d'energie.