...
首页> 外文期刊>IEICE Transactions on Information and Systems >An Efficiency-Aware Scheduling for Data-Intensive Computations on MapReduce Clusters
【24h】

An Efficiency-Aware Scheduling for Data-Intensive Computations on MapReduce Clusters

机译:MapReduce集群上的数据密集型计算的效率感知调度

获取原文
获取原文并翻译 | 示例
           

摘要

Scheduling plays a key role in MapReduce systems. In this paper, we explore the efficiency of an MapReduce cluster running lots of independent and continuously arriving MapReduce jobs. Data locality and load balancing are two important factors to improve computation efficiency in MapReduce systems for data-intensive computations. Traditional cluster scheduling technologies are not well suitable for MapReduce environment, there are some in-used schedulers for the popular open-source Hadoop MapReduce implementation, however, they can not well optimize both factors. Our main objective is to minimize total flowtime of all jobs, given it's a strong NP-hard problem, we adopt some effective heuristics to seek satisfied solution. In this paper, we formalize the scheduling problem as job selection problem, a load balance aware job selection algorithm is proposed, in task level we design a strict data locality tasks scheduling algorithm for map tasks on map machines and a load balance aware scheduling algorithm for reduce tasks on reduce machines. Comprehensive experiments have been conducted to compare our scheduling strategy with well-known Hadoop scheduling strategies. The experimental results validate the efficiency of our proposed scheduling strategy.
机译:调度在MapReduce系统中起着关键作用。在本文中,我们探索了运行大量独立且连续到达的MapReduce作业的MapReduce集群的效率。数据局部性和负载平衡是提高MapReduce系统中数据密集型计算效率的两个重要因素。传统的集群调度技术不太适合MapReduce环境,流行的开源Hadoop MapReduce实现有一些正在使用的调度程序,但是它们不能很好地优化这两个因素。我们的主要目标是最大程度地减少所有作业的总流转时间,考虑到这是一个NP难题,我们采用一些有效的启发式方法来寻求满意的解决方案。在本文中,我们将调度问题形式化为作业选择问题,提出了一种负载平衡感知作业选择算法,在任务级别上,针对地图机上的地图任务设计了严格的数据局部性任务调度算法,以及针对地图机器的负载平衡感知调度算法在减少机器上减少任务。已经进行了全面的实验,以将我们的调度策略与著名的Hadoop调度策略进行比较。实验结果验证了我们提出的调度策略的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号