首页> 外文会议>IEEE International Conference on Advanced Information Networking and Applications >Impact of MapReduce Task Re-execution Policy on Job Completion Reliability and Job Completion Time
【24h】

Impact of MapReduce Task Re-execution Policy on Job Completion Reliability and Job Completion Time

机译:MapReduce任务重新执行策略对作业完成可靠性和作业完成时间的影响

获取原文

摘要

MapReduce has been a worldwide accepted framework for solving data-intensive applications. To prevent MapReduce jobs from being interrupted by node failures which occur frequently in a large-scale MapReduce cluster, current MapReduce implementations, e.g., Hadoop, employ a task re-execution policy (TR policy for short) for MapReduce jobs, i.e., when a map/reduce task of a job fails due to node failure, this policy reperforms the task on another node. However, the impact of the TR policy on job completion reliability and job completion time have not been studied from a theoretical viewpoint, especially when the job is given different characteristics, e.g., different input data sizes, different numbers of reduce tasks, and different intermediate data sizes. In this study, we derive the job completion reliability (JCR for short) of a MapReduce job based on Poisson distributions and analyze the expected job completion time (JCT for short) based on the universal generation function. We use nine settings of task re-execution factor (TR factor for short) to explore the impact of the TR policy on the JCR and JCT of jobs. The results show that the TR policy can effectively improve JCR without significantly prolonging JCT. But there is no single TR factor with which all jobs can achieve a high JCR.
机译:MapReduce已经成为解决数据密集型应用程序的全球公认框架。为了防止MapReduce作业被大型MapReduce集群中经常发生的节点故障中断,当前的MapReduce实现(例如Hadoop)为MapReduce作业采用了任务重新执行策略(简称TR策略),即当由于节点故障,作业的map / reduce任务失败,此策略在另一个节点上重新执行任务。但是,尚未从理论角度研究TR策略对作业完成可靠性和作业完成时间的影响,特别是当作业具有不同的特性(例如,不同的输入数据大小,不同数量的归约任务和不同的中间件)时数据大小。在这项研究中,我们基于泊松分布得出MapReduce作业的作业完成可靠性(简称JCR),并基于通用生成函数分析预期的作业完成时间(简称JCT)。我们使用任务重新执行因子(简称TR因子)的九种设置来探索TR策略对作业的JCR和JCT的影响。结果表明,TR政策可以有效改善JCR,而不会显着延长JCT。但是,没有任何一个TR因素可以使所有作业都达到很高的JCR。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号