Total views : 242

Intra State Recovery System Design for Cloud based Applications


  • School of Engineering and Technology, Ansal University, Gurgaon - 122003, Haryana, India


Background/Objectives: Considering the growing demand for cloud services for development and deploying of critical business applications, it is extremely important that cloud provider guarantees a reliable and robust service by providing fault tolerance mechanisms that enable seamless execution of the business transaction execution even in presence of faulty components. The objective of this paper is to propose a collaborative fault tolerant mechanism between cloud provider and cloud client. Methods/Statistical Analysis: The collaborative fault tolerance approach considers collaboration between the cloud provider and the cloud client to develop a comprehensive fault tolerance solution that can be customised to suit to the hosted cloud applications needs.The proposed design is based on usage of Persistent Map based strategy. Findings: The Persistent Map based strategy saves the state information of execution in the form of P-maps. The P-map is a persistent hash map that stores the current state of execution of a given task. In the case of failure, it can be used to restart the process from the last state at which the task failed and resume the application execution from that point as though no failure occurred.The P-map storage is a crucial element to be considered in the design of the system, that requires careful analysis and can have a huge impact on the execution of an application. Application/Improvements: The authors have considered an approach which requires a collaboration between cloud providers and cloud client to design a fault tolerance mechanism that takes into consideration the complex cloud infrastructure as well behaviour and functionality of the application in focus.


Cloud Computing, Fault Tolerance, Persistent Maps, Recovery System.

Full Text:

 |  (PDF views: 250)


  • Mell P, Grance T. The NIST definition of cloud computing.
  • Mahajan K, Makroo A, Dahiya D. Round robin with server affinity: A VM load balancing algorithm for cloud based infrastructure. Journal of information processing systems. 2013; 9(3):379–94.
  • Armbrust M, Fox A, Griffith R, Joseph AD, Katz R, Konwinski A, Lee G, Patterson D, Rabkin A, Stoica I, Zaharia M. A view of cloud computing. Communications of the ACM. 2010 Apr 1; 53(4):50–8.
  • Shen Z, Subbiah S, Gu X, Wilkes J. Cloudscale: Elastic resource scaling for multi-tenant cloud systems. Proceedings of the 2nd ACM Symposium on Cloud Computing; 2011 Oct 26. p. 5.
  • Zhang Q, Cheng L, Boutaba R. Cloud computing: State-oftheart and research challenges. Journal of Internet Services and Applications. 2010 May 1; 1(1):7–18.
  • Zhao W, Melliar-Smith PM, Moser LE. Fault tolerance middleware for cloud computing. 2010 IEEE 3rd International Conference on Cloud Computing (CLOUD); 2010 Jul 5. p. 67–74.
  • Birman KP, Van Rennesse R. Reliable distributed computing using the ISIS toolkit. Los Alamitos, CA. CA: IEEE Computer Society; 1994.
  • Jhawar R, Piuri V, Santambrogio M. Fault tolerance management in cloud computing: A system-level perspective. IEEE Systems Journal. 2013 Jun; 7(2):288–97.
  • Dikaiakos MD, Katsaros D, Mehra P, Pallis G, Vakali A. Cloud computing: Distributed internet computing for IT and scientific research. IEEE Internet Computing. 2009 Sep; 13(5):10–3.
  • Inoue T, Umeno H, Tanaka S, Yamamoto T, Ohtsuki T. System for recovery from a virtual machine monitor failure with a continuous guest dispatched to a nonguest mode. United States patent US 5,437,033; 1995 Jul 25.
  • Oldfield RA. Investigating lightweight storage and overlay networks for fault tolerance. Proceedings of the High Availability and Performance Computing Workshop; Santa Fe, NM. 2006 Oct.
  • Vallee G, Engelmann C, Tikotekar A, Naughton T, Charoenpornwattana K, Leangsuksun C, Scott SL. A framework for proactive fault tolerance. IEEE Third International Conference on Availability, Reliability and Security, 2008, ARES 08; 2008 Mar 4. p. 659–64.
  • Bala A, Chana I. Fault tolerance-challenges, techniques and implementation in cloud computing. IJCSI. 2012 Jan; 9(1):288–93.
  • Hwang S, Kesselman C. A flexible framework for fault tolerance in the grid. Journal of Grid Computing. 2003 Sep 1; 1(3):251–72.
  • Juhnke E, Dornemann T, Freisleben B. Fault-tolerant BPEL workflow execution via cloud-aware recovery policies. IEEE 35th Euromicro Conference on Software Engineering and Advanced Applications, 2009, SEAA’09; 2009 Aug 27. p. 31–8.
  • Dai Y, Xiang Y, Zhang G. Self-healing and hybrid diagnosis in cloud computing. Cloud Computing. Springer Berlin Heidelberg; 2009 Dec 1. p. 45–56.
  • Krutz RL, Vines RD. Cloud security: A comprehensive guide to secure cloud computing. Wiley Publishing; 2010 Aug 9.
  • Engelmann C, Vallee GR, Naughton T, Scott SL. Proactive fault tolerance using preemptive migration. 2009 IEEE 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing; 2009 Feb 18. p. 252–7.
  • Mahajan K, Dahiya D. A cloud based deployment framework for load balancing policies. 2014 IEEE 7th International Conference on Contemporary Computing (IC3); 2014 Aug 7. p. 565–70.
  • Wu Z, Ni Z, Gu L, Liu X. A revised discrete particle swarm optimization for cloud workflow scheduling. 2010 International Conference on Computational Intelligence and Security (CIS); 2010 Dec 11. p. 184–8.
  • Guermouche A, Ropars T, Brunet E, Snir M, Cappello F. Uncoordinated checkpointing without domino effect for send-deterministic mpi applications. 2011 IEEE International Parallel and Distributed Processing Symposium (IPDPS); 2011 May 16. p. 989–1000.
  • Beloglazov A, Buyya R. Energy efficient resource management in virtualized cloud data centers. Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing; 2010 May 17. p. 826–31. IEEE Computer Society.
  • Jhawar R, Piuri V, Santambrogio M. A comprehensive conceptual system-level approach to fault tolerance in cloud computing. 2012 IEEE International Systems Conference (SysCon); 2012 Mar 19. p. 1–5.
  • Nicolae B, Cappello F. BlobCR: Efficient checkpoint-restart for HPC applications on IaaS clouds using virtual disk image snapshots. Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis; 2011 Nov 12. p. 34.
  • Pal AS, Pattnaik BPK. Classification of virtualization environment for cloud computing. Indian Journal of Science and Technology. 2013 Jan; 6(1). DOI: 10.17485/ijst/2013/ v6i1/30572.
  • Rajathi A, Saravanan N. A survey on secure storage in cloud computing. Indian Journal of Science and Technology. 2013 Apr; 6(4). DOI: 10.17485/ijst/2013/v6i4/31871.
  • Nagaraju S, Parthiban L. SecAuthn: Provably secure multi-factor authentication for the cloud computing systems. Indian Journal of Science and Technology. 2016 Mar; 9(9). DOI: 10.17485/ijst/2016/v9i9/81070.
  • Kumari PS, Kamal ARNB. Optimal integrity policy for encrypted data in secure storage using cloud computing. Indian Journal of Science and Technology. 2016 Mar; 9(11). DOI: 10.17485/ijst/2016/v9i11/88453.


  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.