Corpus ID: 10597000

Network and Data Location Aware Job Scheduling in Grid : Improvement to GridWay Metascheduler

@inproceedings{Kumar2012NetworkAD,
  title={Network and Data Location Aware Job Scheduling in Grid : Improvement to GridWay Metascheduler},
  author={Saumesh Kumar and Naveen Kumar},
  year={2012}
}
Grid Computing has enabled us to utilize the unused computing power (CPU cycles) of computers connected to networks (e.g. Internet). Nowadays, there are lots of scientific projects going on in the domain of High Energy Physics (HEP) and Grid infrastructure constitutes the core computing facility of these projects. One such project is LHC (Large Hadron Collider) deployed at CERN. These experiments produce and manage a large amount of data per day and run thousands of computing jobs to process… Expand

Figures and Tables from this paper

A New Job Scheduling in Data Grid Environment Based on Data and Computational Resource Availability
TLDR
A new Integrated Job Scheduling Strategy (IJSS) is proposed, which considers both data and computational resource availability of the network, and on the other hand, considering the corresponding requirements of each job, it determines a value called W to the job. Expand
Improve the Performance of Data Grids by Cost-Based Job Scheduling Strategy
TLDR
A new Cost-Based Job Scheduling (CJS) strategy is proposed, which considers both data and computational resource availability of the network and the corresponding requirements of each job, and determines a value called W to the job. Expand
A Hybrid Approach for Scheduling based on Multi-criteria Decision Method in Data Grid
TLDR
A new Hybrid Job Scheduling (HJS) strategy is proposed, which considers both data and computational resource availability of the network, and on the other hand, considering the corresponding requirements of each job, it determines a value called W to the job. Expand
A data-aware cognitive engine for scheduling data intensive applications in a grid
TLDR
A novel mechanism of incorporating cognitive science in a data grid is proposed for scheduling data-intensive workflows in which a cognitive engine (CE) is built into the middleware of the data grid. Expand
Data-Aware Task Allocation for Achieving Low Latency in Collaborative Edge Computing
TLDR
This paper proposes a multistage greedy adjustment (MSGA) algorithm where the task scheduling is done by considering both placement of tasks and adjustment of network flows and shows that MSGA leads to up to 27% improvement in completion time as compared to benchmark solutions. Expand
Seamlessly Enabling the Use of Cloud Resources in Workflows
The hosting of large on-premise computational resources is common practice. Cloud Computing offers a promising, alternative infrastructure for using scalable ondemand off-premise resources. However,Expand

References

SHOWING 1-10 OF 20 REFERENCES
Data Intensive and Network Aware (DIANA) Grid Scheduling
TLDR
A Data Intensive and Network Aware (DIANA) meta-scheduling approach, which takes into account data, processing power and network characteristics when making scheduling decisions across multiple sites is described. Expand
Studying the Influence of Network-Aware Grid Scheduling on the Performance Received by Users
TLDR
Results presented here show that the response time perceived by Grid users is reduced when data on network performance are considered in the job scheduling process, and the first step in order to proceed with more complicated and efficient scheduling and reservation processes. Expand
Improving GridWay with network information: Tuning the monitoring tool
TLDR
A tuning of the network monitoring tool is presented and evaluated, showing that the overhead caused by this monitoring tool varies depending on the monitoring parameters, and these ones, in turn, depend on the actual Grid environment. Expand
Simulation Studies of Computation and Data Scheduling Algorithms for Data Grids
TLDR
This work develops a suite of job scheduling and data replication algorithms that are evaluated using simulations for a wide range of parameters and finds that scheduling jobs to locations that contain the data they need and asynchronously replicating popular data-sets to remote sites works rather well. Expand
Economic-based Distributed Resource Management and Scheduling for Grid Computing
TLDR
This thesis proposes a distributed computational economy as an effective metaphor for the management of resources and application scheduling and proposes an architectural framework that supports resource trading and quality of services based scheduling that enables the regulation of supply and demand for resources. Expand
Decoupling computation and data scheduling in distributed data-intensive applications
  • K. Ranganathan, I. Foster
  • Computer Science
  • Proceedings 11th IEEE International Symposium on High Performance Distributed Computing
  • 2002
TLDR
This work develops a family of algorithms and uses simulation studies to evaluate various combinations of these algorithms to suggest that while it is necessary to consider the impact of replication, it is not always necessary to couple data movement and computation scheduling. Expand
Nimrod/G: an architecture for a resource management and scheduling system in a global computational grid
  • R. Buyya, D. Abramson, J. Giddy
  • Computer Science
  • Proceedings Fourth International Conference/Exhibition on High Performance Computing in the Asia-Pacific Region
  • 2000
TLDR
The proposed Nimrod/G grid-enabled resource management and scheduling system builds on the earlier work on Nimrod and follows a modular and component-based architecture enabling extensibility, portability, ease of development, and interoperability of independently developed components. Expand
A Flexible Job Scheduling System for Heterogeneous Grids
TLDR
Experiments are carried out and proven that FISS proposed in the paper can improve the efficiency and utilization of grid system while satisfying users' QoS requirements and achieving the interoperability among heterogeneous grids. Expand
Evaluating scheduling and replica optimisation strategies in OptorSim
TLDR
The results show that the choice of scheduling and data replication strategies can have a large effect on both job throughput and the overall consumption of grid resources. Expand
Genetic algorithm based schedulers for grid computing systems
TLDR
The extensive experimental study showed that the GA-based schedulers outperform existing GA implementations in the literature for the problem and also revealed their efficiency when makespan and flowtime are minimized either in a hierarchical or a simultaneous optimization mode. Expand
...
1
2
...