首页> 外文期刊>Software >Generic windowing support for extensible stream processing systems
【24h】

Generic windowing support for extensible stream processing systems

机译:对可扩展流处理系统的通用窗口支持

获取原文
获取原文并翻译 | 示例
           

摘要

Stream processing applications process high volume, continuous feeds from live data sources, employ data-in-motion analytics to analyze these feeds, and produce near real-time insights with low latency. One of the fundamental characteristics of such applications is the on-the-fly nature of the computation, which does not require access to disk resident data. Stream processing applications store the most recent history of streams in memory and use it to perform the necessary modeling and analysis tasks. This recent history is often managed using windows. All data stream management systems provide some form of windowing functionality. Windowing makes it possible to implement streaming versions of the traditionally blocking relational operators, such as streaming aggregations, joins, and sorts, as well as any other analytic operator that requires keeping the most recent tuples as state, such as time series analysis operators and signal processing operators. In this paper, we provide a categorization of different window types and policies employed in stream processing applications and give detailed operational semantics for various window configurations. We describe an extensibility mechanism that makes it possible to integrate windowing support into user-defined operators, enabling consistent syntax and semantics across system-provided and third-party toolkits of streaming operators. We describe the design and implementation of a runtime windowing library that significantly simplifies the construction of window-based operators by decoupling the handling of window policies and operator logic from each other. We present our experience using the windowing library to implement a relational operators toolkit and compare the efficacy of the solution to an earlier implementation that did not employ a common windowing library.
机译:流处理应用程序处理来自实时数据源的大量,连续的提要,使用动态数据分析来分析这些提要,并以低延迟生成近乎实时的洞察力。这种应用程序的基本特征之一是计算的即时性,它不需要访问磁盘驻留数据。流处理应用程序将流的最新历史记录存储在内存中,并使用它来执行必要的建模和分析任务。最近的历史记录通常使用Windows进行管理。所有数据流管理系统都提供某种形式的窗口功能。窗口化使得可以实现传统阻塞关系运算符的流版本,例如流聚合,联接和排序,以及需要保留最新元组作为状态的任何其他分析运算符,例如时间序列分析运算符和信号加工经营者。在本文中,我们提供了流处理应用程序中使用的不同窗口类型和策略的分类,并给出了各种窗口配置的详细操作语义。我们描述了一种可扩展性机制,该机制使将窗口支持集成到用户定义的运算符中成为可能,从而使跨流运算符的系统提供的第三方工具包中的语法和语义保持一致。我们描述了运行时窗口库的设计和实现,该库通过将窗口策略和运算符逻辑的处理彼此分离,大大简化了基于窗口的运算符的构造。我们将介绍我们使用窗口库来实现关系运算符工具包的经验,并将该解决方案的功效与未采用公共窗口库的较早实现进行比较。

著录项

  • 来源
    《Software》 |2014年第9期|1105-1128|共24页
  • 作者

    Bugra Gedik;

  • 作者单位

    Computer Engineering Department, Bilkent University, Ankara 06800, Turkey;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    data stream processing; windowing semantics; windowing library;

    机译:数据流处理;窗口语义;窗口库;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号