首页> 外文期刊>Journal of Southeast University >Data partitioning based on sampling for power load streams
【24h】

Data partitioning based on sampling for power load streams

机译:基于采样的电力负载流数据分区

获取原文
获取原文并翻译 | 示例
           

摘要

A novel data streams partitioning method is proposed to resolve problems of range-aggregation continuous queries over parallel streams for power industry. The first step of this method is to parallel sample the data, which is implemented as an extended reservoir-sampling algorithm. A skip factor based on the change ratio of data-values is introduced to describe the distribution characteristics of data-values adaptively. The second step of this method is to partition the fluxes of data streams averagely, which is implemented with two alternative equal-depth histogram generating algorithms that fit the different cases: one for incremental maintenance based on heuristics and the other for periodical updates to generate an approximate partition vector. The experimental results on actual data prove that the method is efficient, practical and suitable for time-varying data streams processing.
机译:提出了一种新颖的数据流分区方法,以解决电力行业并行流范围聚集连续查询的问题。该方法的第一步是对数据进行并行采样,将其实现为扩展的储层采样算法。引入基于数据值变化率的跳跃因子来自适应地描述数据值的分布特征。此方法的第二步是平均分配数据流的通量,这是通过两种适合不同情况的替代等深度直方图生成算法实现的:一种用于基于启发式的增量维护,另一种用于定期更新以生成数据。近似分区向量。实际数据的实验结果证明,该方法有效,实用,适用于时变数据流的处理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号