作者: Qi Liu , Weidong Cai , Jian Shen , Zhangjie Fu , Xiaodong Liu
DOI: 10.23919/ICACT.2017.7890240
关键词:
摘要: MapReduce (MR) has been widely used to process distributed large data sets. MRV2 working on Yarn, as a more advanced programing model, gained lots of concerns. Meanwhile, speculative execution is known an approach for dealing with same problems by backing up those tasks running low performance machine higher one. In this paper, we have modified some pitfalls and taken heterogeneous environment into consideration. Besides, Node classification novel hierarchy index mechanism created. We also implemented it in Hadoop-2.6 the strategy above called Speculation-NC while optimized Hadoop Hadoop-NC. Experiment results show that our method can correctly backup task, improve decrease time resource consumption compared traditional strategies.