首页> 外文期刊>Wuhan University Journal of Natural Sciences >A Granularity-Aware Parallel Aggregation Method for Data Streams
【24h】

A Granularity-Aware Parallel Aggregation Method for Data Streams

机译:数据流的粒度感知并行聚合方法

获取原文
获取原文并翻译 | 示例
           

摘要

This paper focuses on the parallel aggregation processing of data streams based on the shared-nothing architecture. A novel granularity-aware parallel aggregating model is proposed. It employs parallel sampling and linear regression to describe the characteristics of the data quantity in the query window in order to determine the partition granularity of tuples, and utilizes equal depth histogram to implement partitioning. This method can avoid data skew and reduce communication cost. The experimentresults on both synthetic data and actual data prove that the proposed method is efficient, practical and suitable for time varying data streams processing.
机译:本文重点讨论基于无共享架构的数据流的并行聚合处理。提出了一种新的粒度感知并行聚合模型。它使用并行采样和线性回归来描述查询窗口中数据量的特征,以确定元组的分区粒度,并利用相等的深度直方图来实现分区。这种方法可以避免数据偏斜并降低通信成本。综合数据和实际数据的实验结果表明,该方法高效,实用,适用于时变数据流的处理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号