The Importance of Complete Data Sets for Job Scheduling Simulations

  title={The Importance of Complete Data Sets for Job Scheduling Simulations},
  author={Dalibor Klus{\'a}cek and Hana Rudov{\'a}},
This paper has been inspired by the study of the complex data set from the Czech National Grid MetaCentrum. Unlike other widely used workloads from Parallel Workloads Archive or Grid Workloads Archive, this data set includes additional information concerning machine failures, job requirements and machine parameters which allows to perform more realistic simulations. We show that large differences in the performance of various scheduling algorithms appear when these additional information are… CONTINUE READING

From This Paper

Topics from this paper.


Publications referenced by this paper.
Showing 1-10 of 40 references

Computational models and heuristic methods for Grid scheduling problems

Future Generation Comp. Syst. • 2010
View 5 Excerpts
Highly Influenced

The Failure Trace Archive: Enabling Comparative Analysis of Failures in Diverse Distributed Systems

2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing • 2010
View 5 Excerpts
Highly Influenced

A Large-Scale Study of Failures in High-Performance Computing Systems

IEEE Transactions on Dependable and Secure Computing • 2006
View 4 Excerpts
Highly Influenced

Experimental analysis of the root causes of performance evaluation results: a backfilling case study

IEEE Transactions on Parallel and Distributed Systems • 2005
View 7 Excerpts
Highly Influenced

Continuous Univariate Distributions, volume 1

Norman L. Johnson, Samuel Kotz, N. Balakrishnan
Wiley-Interscience, second edition, • 1994
View 6 Excerpts
Highly Influenced

A toolkit for modelling and simulating data Grids: an extension to GridSim

Concurrency and Computation: Practice and Experience • 2008
View 2 Excerpts
Highly Influenced

Similar Papers

Loading similar papers…