Combined Fault Tolerance and Scheduling Techniques for Workflow Applications on Computational Grids

@article{Zhang2009CombinedFT,
  title={Combined Fault Tolerance and Scheduling Techniques for Workflow Applications on Computational Grids},
  author={Yonghui Zhang and Anirban Mandal and Charles Koelbel and Keith D. Cooper},
  journal={2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid},
  year={2009},
  pages={244-251}
}
Complex scientific workflows are now Increasingly executed on computational grids. In addition to the challenges of managing and scheduling these workflows, reliability challenges arise because of the unreliable nature of large-scale grid infrastructure. Fault tolerance mechanisms like over-provisioning and checkpoint-recovery are used in current grid application management systems to address these reliability challenges. In this work, we propose new approaches that combine these fault… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-10 OF 39 CITATIONS

References

Publications referenced by this paper.
SHOWING 1-10 OF 24 REFERENCES

Similar Papers

Loading similar papers…