首页> 外文会议>International conference on data science engineering >Fault tolerant state management for high-volume low-latency data stream workloads
【24h】

Fault tolerant state management for high-volume low-latency data stream workloads

机译:大容量低延迟数据流工作负载的容错状态管理

获取原文

摘要

One of the major challenges in performing incremental computations on parallel distributed stream processing systems is in the implementation of a mechanism for passing state values across successive runs. One approach is to enhance the granularity from record-at-a-time processing to processing at micro-batch level. A contrasting approach is to follow the record-at-a-time semantics and ensure scalability by means of distributed state management. Both approaches, however, require observing high degree of fault tolerance. In this paper, we study the problem of process state management against non-terminating data stream workloads for low-latency computing using the micro-batch stream processing approach. We attempt to examine methods that could yield optimum levels of state retentions with high degree of fault tolerance for typical processing workloads and propose a three-pronged approach to harness the demand.
机译:在并行分布式流处理系统上执行增量计算的主要挑战之一是在实现连续运行中传递状态值的机制。一种方法是将粒度从记录在一次处理中提升到微批级处理。对比度方法是遵循记录AT-AT-A-TIME语义,并通过分布式状态管理确保可扩展性。然而,这两种方法都需要观察到高度的容错程度。在本文中,我们使用微批流处理方法对低延迟计算的非终止数据流工作负载的过程状态管理问题。我们试图检查能够为典型加工工作负载提高高度容错的最佳状态保留的方法,并提出一种三管齐下的方法来利用需求。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号