首页> 外文期刊>Journal of Computer Science & Technology >PGG: An Online Pattern Based Approach for Stream Variation Management
【24h】

PGG: An Online Pattern Based Approach for Stream Variation Management

机译:PGG:一种基于在线模式的流变化管理方法

获取原文
获取原文并翻译 | 示例
           

摘要

Many database applications require efficient processing of data streams with value variations and fluctuant sampling frequency. The variations typically imply fundamental features of the stream and important domain knowledge of underlying objects. In some data streams, successive events seem to recur in a certain time interval, but the data indeed evolves with tiny differences as time elapses. This feature, so called pseudo periodicity, poses a new challenge to stream variation management. This study focuses on the online management for variations over such streams. The idea can be applied to many scenarios such as patient vital signal monitoring in medical applications. This paper proposes a new method named Pattern Growth Graph (PGG) to detect and manage variations over evolving streams with following features: 1) adopts the wave-pattern to capture the major information of data evolution and represent them compactly; 2) detects the variations in a single pass over the stream with the help of wave-pattern matching algorithm; 3) only stores different segments of the pattern for incoming stream, and hence substantially compresses the data without losing important information; 4) distinguishes meaningful data changes from noise and reconstructs the stream with acceptable accuracy. Extensive experiments on real datasets containing millions of data items, as well as a prototype system, are carried out to demonstrate the feasibility and effectiveness of the proposed scheme.
机译:许多数据库应用程序需要有效处理具有值变化和波动的采样频率的数据流。这些变化通常暗示流的基本特征和基础对象的重要领域知识。在某些数据流中,连续的事件似乎在特定的时间间隔内重复发生,但是随着时间的流逝,数据的发展确实存在微小的差异。这个功能,所谓的伪周期性,对流变化管理提出了新的挑战。这项研究的重点是在线管理此类流的变化。该想法可以应用于许多场景,例如医疗应用中的患者生命信号监测。本文提出了一种新的模式增长图(PGG)方法,用于检测和管理演化流的变化,具有以下特点:1)采用波型图捕获数据演化的主要信息,并紧凑地表示它们。 2)借助波形模式匹配算法检测流中单次通过的变化; 3)只为输入流存储模式的不同段,因此实质上压缩了数据而不会丢失重要信息; 4)区分有意义的数据变化和噪声,并以可接受的精度重建流。对包含数百万个数据项的真实数据集以及原型系统进行了广泛的实验,以证明该方案的可行性和有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号