VirtCFT: A Transparent VM-Level Fault-Tolerant System for Virtual Clusters

@article{Zhang2010VirtCFTAT,
  title={VirtCFT: A Transparent VM-Level Fault-Tolerant System for Virtual Clusters},
  author={Minjia Zhang and Hai Jin and Xuanhua Shi and Song Wu},
  journal={2010 IEEE 16th International Conference on Parallel and Distributed Systems},
  year={2010},
  pages={147-154}
}
A virtual cluster consists of a multitude of virtual machines and software components that are doomed to fail eventually. In many environments, such failures can result in unanticipated, potentially devastating failure behavior and in service unavailability. The ability of failover is essential to the virtual cluster’s availability, reliability, and manageability. Most of the existing methods have several common disadvantages: requiring modifications to the target processes or their OSes, which… CONTINUE READING
Highly Cited
This paper has 21 citations. REVIEW CITATIONS

Citations

Publications citing this paper.
Showing 1-10 of 16 extracted citations

Adaptive time-based coordinated checkpointing for cloud computing workfl ows

Scalable Computing: Practice and Experience • 2014
View 6 Excerpts
Highly Influenced

On Cloud Service Reliability Enhancement with Optimal Resource Usage

IEEE Transactions on Cloud Computing • 2016
View 9 Excerpts
Highly Influenced

Reliability enhancement for cloud services - a survey

2016 International Conference on Computer Communication and Informatics (ICCCI) • 2016
View 1 Excerpt

References

Publications referenced by this paper.
Showing 1-10 of 19 references

Kemari: virtual machine synchronization for fault tolerance

Y. Tamura, K. Sato, S.Kihara, S. Moriai
Proc. USENIX'08 Poster Session, • 2008
View 1 Excerpt

Computing in the clouds

A. Weiss
netWorke, pp.16-25, • 2007
View 1 Excerpt

Software failures and the road to a petaflop machine

I. Philp
Proc. the 1st Workshop on High Performance Computing Reliability Issues, • 2005
View 1 Excerpt

Architecture of LA-MPI, a network-fault-tolerant MPI

18th International Parallel and Distributed Processing Symposium, 2004. Proceedings. • 2004
View 1 Excerpt

Similar Papers

Loading similar papers…