【24h】

Task allocation for distributed stream processing

机译:分布式流处理的任务分配

获取原文
获取原文并翻译 | 示例

摘要

There is a growing demand for live, on-the-fly processing of increasingly large amounts of data. In order to ensure the timely and reliable processing of streaming data, a variety of distributed stream processing architectures and platforms have been developed, which handle the fundamental tasks of (dynamically) assigning processing tasks to the currently available physical resources and routing streaming data between these resources. However, while there are plenty of platforms offering such functionality, the theory behind it is not well understood. In particular, it is unclear how to best allocate the processing tasks to the given resources. In this paper, we establish a theoretical foundation by formally defining a task allocation problem for distributed stream processing, which we prove to be NP-hard. Furthermore, we propose an approximation algorithm for the class of series-parallel decomposable graphs, which captures a broad range of common stream processing applications. The algorithm achieves a constant-factor approximation under the assumptions that the number of resources scales at least logarithmically with the number of computational tasks and the computational cost of the tasks dominates the cost of communication.
机译:对实时,实时处理越来越多的数据的需求不断增长。为了确保及时,可靠地处理流数据,已经开发了各种分布式流处理体系结构和平台,它们处理(动态)将处理任务分配给当前可用的物理资源并在它们之间路由流数据的基本任务。资源。但是,尽管有很多平台提供了这种功能,但其背后的理论尚未得到很好的理解。特别是,不清楚如何最佳地将处理任务分配给给定资源。在本文中,我们通过正式定义分布式流处理的任务分配问题来建立理论基础,我们证明这是NP难的。此外,我们针对一类串并联可分解图提出了一种近似算法,该算法捕获了广泛的常见流处理应用程序。在资源数量至少与计算任务数量成对数比例缩放且任务的计算成本主导通信成本的假设下,该算法可实现恒定因子近似。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号