首页> 外文学位 >Toward practical multi-workflow scheduling in cluster and grid environments.
【24h】

Toward practical multi-workflow scheduling in cluster and grid environments.

机译:在集群和网格环境中实现实用的多工作流计划。

获取原文
获取原文并翻译 | 示例

摘要

Workflow applications are gaining popularity in recent years because of the prevalence of cluster and Grid environments. Many algorithms have been developed ever since, however two fundamental challenges in this area, i.e., dynamic resource and dynamic workload, are not well addressed. In cluster and Grid environments, resources may be contributed and controlled by different virtual organizations and shared by a variety of users who in turn submit various kinds of applications. Resources are heterogeneous under different ownership, their availability varies over time and may fail in a high rate. On the other hand, resources are shared and hence competed among many applications with various computation requirements. Existing static algorithms are designed to schedule a single workflow application, without considering other workloads and any resource competition in the system. Hence static approaches are not utilized widely in practice despite its known advantages. Dynamic scheduling approaches can handle the dynamic workload and resources practically by nature but their effectiveness has yet to optimize as they do not have a global view of workflow application and scheduling decision is made nearsighted locally.;In this dissertation, as an effort toward practically scheduling workflow applications in cluster and Grid environments, a failure aware dynamic scheduling strategy for multiple workflow applications is proposed. The approach makes scheduling decision only when a task is ready, as traditional dynamic approach does, but leverages task dependency information, execution time estimation, failure prediction and queue wait time prediction. With preassigned priority for each task by the workflow Planner, the workflow Executor globally prioritizes all the ready to execute tasks in queue and schedules the individual task to the most suitable resource collection in order to minimize the overall workflow execution time. Furthermore, the algorithm is extended to a cluster of clusters environment, where each cluster has its own local workload management system. As a conclusion, the findings of the research is four folded: (1) With adaptability to dynamic resource change, the proposed strategy not only outperforms the purely dynamic ones but also improves over the traditional static ones. And it performs more efficiently with data intensive application of higher degree of parallelism. (2) When guided by the Planner, the proposed strategy can schedule multiple workflows dynamically without requiring merging the workflows a priori. It significantly outperforms two other traditional dynamic algorithms by 43.6% and 36.7% with respect to workflow makespan and turnaround time respectively, and it performs even better when the number of concurrent workflow applications increases and the resources are scarce. (3) We observer that the traditional failure prediction accuracy definitions impose different performance implications on different applications and fail to measure how that improves scheduling effectiveness, and propose two definitions on failure prediction accuracy from the perspectives of system and scheduling respectively. The comprehensive evaluation results using real failure traces show that the proposed strategy performs well with practically achievable prediction accuracy by reducing the average makespan, the loss time and the number of job rescheduling. (4) The proposed algorithm can be augmented to Grids in form multicluster where each cluster has its own workload management system. The proposed queue wait time aware algorithm leverages the advancement of queue wait time prediction techniques and empirically studies if the tunability of resource requirements helps scheduling. The extensive experiment with both real workload traces and test bench shows that the queue wait time aware algorithm improves workflow performance by 3 to 10 times in terms of average makespan with relatively very low cost of data movement.;Finally, the research studies how to benefit from existing researches and practices on both static and dynamic scheduling, introduces a hybrid scheduling scheme, i.e., a planner guided dynamic scheduling approach, targets on dynamic workload on cluster and Grid environment. A prototype is developed based on Condor platform to prove the concept of proposed algorithm.
机译:近年来,由于集群和网格环境的盛行,工作流应用程序变得越来越流行。自那时以来已经开发了许多算法,但是,该领域中的两个基本挑战,即动态资源和动态工作负载,没有得到很好的解决。在群集和网格环境中,资源可能由不同的虚拟组织提供和控制,并由各种用户共享,这些用户又提交了各种应用程序。在不同的所有权下资源是异构的,它们的可用性随时间而变化,并且可能以很高的速度失败。另一方面,资源是共享的,因此在具有各种计算要求的许多应用程序之间竞争。现有的静态算法旨在安排单个工作流程应用程序,而无需考虑其他工作负载和系统中的任何资源竞争。因此,尽管静态方法具有已知的优点,但在实践中并未得到广泛使用。动态调度方法实际上可以自然地处理动态工作负载和资源,但是它们的效率尚未得到优化,因为它们没有全局的工作流应用程序视图,并且调度决策是在本地进行近视的。在集群和Grid环境中的工作流应用程序中,提出了针对多个工作流应用程序的故障感知动态调度策略。与传统的动态方法一样,该方法仅在任务准备就绪时才进行调度决策,但是会利用任务依赖性信息,执行时间估计,故障预测和队列等待时间预测。通过工作流规划器为每个任务预先分配的优先级,工作流执行程序全局地对所有准备就绪的待执行任务进行优先级排序,并将单个任务安排到最合适的资源集合中,以最大程度地缩短总体工作流执行时间。此外,该算法被扩展到集群环境中,其中每个集群都有自己的本地工作负载管理系统。综上所述,本研究的结果有四个方面:(1)该策略在适应动态资源变化的同时,不仅优于纯动态策略,而且对传统静态策略进行了改进。而且,通过高度并行的数据密集型应用程序,它可以更高效地执行。 (2)在计划者的指导下,提出的策略可以动态地调度多个工作流,而无需先验地合并工作流。在工作流的完成时间和周转时间方面,它的性能分别比其他两个传统动态算法分别高出43.6%和36.7%,并且当并发工作流应用程序的数量增加且资源匮乏时,它的性能甚至更好。 (3)我们观察到,传统的故障预测精度定义对不同的应用施加不同的性能影响,并且无法衡量如何提高调度效率,并且分别从系统和调度的角度提出了两种故障预测精度的定义。使用实际故障痕迹的综合评估结果表明,所提出的策略通过减少平均有效期,减少损失时间和重新安排工作的次数,可以很好地实现预测精度。 (4)所提出的算法可以扩展为多集群形式的网格,其中每个集群都有自己的工作负载管理系统。提出的队列等待时间感知算法利用了队列等待时间预测技术的发展,并通过经验研究了资源需求的可调性是否有助于调度。结合实际工作负载跟踪和测试台架进行的广泛实验表明,队列等待时间感知算法将工作流性能平均提高了3到10倍,而数据移动成本却相对较低。从有关静态和动态调度的现有研究和实践中,介绍了一种混合调度方案,即计划者指导的动态调度方法,其目标是集群和Grid环境中的动态工作负载。基于Condor平台开发了原型,以证明所提出算法的概念。

著录项

  • 作者

    Yu, Zhifeng.;

  • 作者单位

    Wayne State University.;

  • 授予单位 Wayne State University.;
  • 学科 Operations Research.;Computer Science.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 149 p.
  • 总页数 149
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 运筹学;自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号