首页> 外文会议>IEEE International Conference on Cloud Computing Technology and Science >Gemini: An Adaptive Performance-Fairness Scheduler for Data-Intensive Cluster Computing
【24h】

Gemini: An Adaptive Performance-Fairness Scheduler for Data-Intensive Cluster Computing

机译:Gemini:适用于数据密集型集群计算的自适应性能公平调度器

获取原文

摘要

In data-intensive cluster computing platforms such as Hadoop YARN, performance and fairness are two important factors for system design and optimizations. Many previous studies are either for performance or for fairness solely, without considering the tradeoff between performance and fairness. Recent studies observe that there is a tradeoff between performance and fairness because of resource contention between users/jobs. However, their scheduling algorithms for bi-criteria optimization between performance and fairness are static, without considering the impact of different workload characteristics on the tradeoff between performance and fairness. In this paper, we propose an adaptive scheduler called Gemini for Hadoop YARN. We first develop a model with the regression approach to estimate the performance improvement and the fairness loss under the sharing computation compared to the exclusive non-sharing scenario. Next, we leverage the model to guide the resource allocation for pending tasks to optimize the performance of the cluster given the user-defined fairness level. Instead of using a static scheduling policy, Gemini adaptively decides the proper scheduling policy according to the current running workload. We implement Gemini in Hadoop YARN. Experimental results show that Gemini outperforms the state-of-the-art approach in two aspects. 1) For the same fairness loss, Gemini improves the performance by up to 225% and 200% in real deployment and the large-scale simulation, respectively, 2) For the same performance improvement, Gemini reduces the fairness loss up to 70% and 62.5% in real deployment and the large-scale simulation, respectively.
机译:在诸如Hadoop YARN之类的数据密集型集群计算平台中,性能和公平性是系统设计和优化的两个重要因素。先前的许多研究都是出于性能或公正性考虑,而没有考虑性能与公平性之间的权衡。最近的研究发现,由于用户/工作之间的资源争用,因此在性能和公平性之间存在折衷。但是,它们用于性能和公平性之间的双标准优化的调度算法是静态的,没有考虑不同工作负载特征对性能和公平性之间的折衷的影响。在本文中,我们为Hadoop YARN提出了一种称为Gemini的自适应调度程序。我们首先使用回归方法开发一个模型,以估计与专有非共享方案相比在共享计算下的性能改进和公平损失。接下来,在给定用户定义的公平性级别的情况下,我们利用模型来指导未完成任务的资源分配,以优化集群的性能。 Gemini不会使用静态调度策略,而是根据当前正在运行的工作负载自适应地确定适当的调度策略。我们在Hadoop YARN中实现Gemini。实验结果表明,双子星座在两个方面都优于最新方法。 1)对于相同的公平性损失,Gemini在实际部署和大规模仿真中分别将性能提高了225%和200%,2)对于相同的性能改善,Gemini减少了70%的公平性损失,以及实际部署和大规模仿真的比例分别为62.5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号