On the design of reactive approach with flexible checkpoint interval to tolerate faults in cloud computing systems

作者: Mohammed Amoon , Nirmeen El-Bahnasawy , Samy Sadi , Manar Wagdi

DOI: 10.1007/S12652-018-1139-Y

关键词:

摘要: The likelihood of failures rises in cloud computing systems as a result their unstable nature. Additionally, the size system varies with time and thus become common incident. Failures have high impact on performance expected benefits for both customers providers. Fault tolerance is an essential challenge facing providers order to mitigate effects maintaining Service Level Agreement (SLA) satisfied. Checkpointing one most known reactive fault techniques used distributed computing. However, it can incur considerable overheads that depend interval checkpoint applied these put down cloud. In this paper, approach context checkpointing proposed evaluated aim getting better performance. depends applying flexible reduce overheads. Simulation experiments indicate superior terms power consumption, response time, monetary cost capacity.

参考文章(28)
Simon Ostermann, Kassian Plankensteiner, Radu Prodan, Thomas Fahringer, GroudSim: an event-based simulation framework for computational grids and clouds european conference on parallel processing. pp. 305- 313 ,(2010) , 10.1007/978-3-642-21878-1_38
Sheng Di, Yves Robert, Frédéric Vivien, Derrick Kondo, Cho-Li Wang, Franck Cappello, Optimization of cloud task processing with checkpoint-restart mechanism ieee international conference on high performance computing data and analytics. pp. 64- ,(2013) , 10.1145/2503210.2503217
Rajkumar Buyya, Chee Shin Yeo, Srikumar Venugopal, James Broberg, Ivona Brandic, None, Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility Future Generation Computer Systems. ,vol. 25, pp. 599- 616 ,(2009) , 10.1016/J.FUTURE.2008.12.001
Nakharin Limrungsi, Juzi Zhao, Yu Xiang, Tian Lan, H. Howie Huang, Suresh Subramaniam, Providing reliability as an elastic service in cloud computing international conference on communications. pp. 2912- 2917 ,(2012) , 10.1109/ICC.2012.6364649
Ghalem Belalem, Said Limam, Fault Tolerant Architecture to Cloud Computing Using Adaptive Checkpoint International Journal of Cloud Applications and Computing archive. ,vol. 1, pp. 60- 69 ,(2011) , 10.4018/IJCAC.2011100105
Xiang Ni, Esteban Meneses, Laxmikant V. Kale, Hiding Checkpoint Overhead in HPC Applications with a Semi-Blocking Algorithm international conference on cluster computing. pp. 364- 372 ,(2012) , 10.1109/CLUSTER.2012.82
Inigo Goiri, Ferran Julia, Jordi Guitart, Jordi Torres, Checkpoint-based fault-tolerant infrastructure for virtualized service providers network operations and management symposium. pp. 455- 462 ,(2010) , 10.1109/NOMS.2010.5488493
Dong Liu, A fault-tolerant architecture for ROIA in cloud ambient intelligence. ,vol. 6, pp. 587- 595 ,(2015) , 10.1007/S12652-014-0220-4
Dzmitry Kliazovich, Pascal Bouvry, Samee Ullah Khan, GreenCloud: a packet-level simulator of energy-aware cloud computing data centers The Journal of Supercomputing. ,vol. 62, pp. 1263- 1283 ,(2012) , 10.1007/S11227-010-0504-1
Said Limam, Ghalem Belalem, A Migration Approach for Fault Tolerance in Cloud Computing ieee international conference on high performance computing data and analytics. ,vol. 6, pp. 24- 37 ,(2014) , 10.4018/IJGHPC.2014040102