...
首页> 外文期刊>Knowledge and Information Systems >Answering ad hoc aggregate queries from data streams using prefix aggregate trees
【24h】

Answering ad hoc aggregate queries from data streams using prefix aggregate trees

机译:使用前缀聚合树回答数据流中的临时聚合查询

获取原文
获取原文并翻译 | 示例
           

摘要

In some business applications such as trading management in financial institutions, it is required to accurately answer ad hoc aggregate queries over data streams. Materializing and incrementally maintaining a full data cube or even its compression or approximation over a data stream is often computationally prohibitive. On the other hand, although previous studies proposed approximate methods for continuous aggregate queries, they cannot provide accurate answers. In this paper, we develop a novel prefix aggregate tree (PAT) structure for online warehousing data streams and answering ad hoc aggregate queries. Often, a data stream can be partitioned into the historical segment, which is stored in a traditional data warehouse, and the transient segment, which can be stored in a PAT to answer ad hoc aggregate queries. The size of a PAT is linear in the size of the transient segment, and only one scan of the data stream is needed to create and incrementally maintain a PAT. Although the query answering using PAT costs more than the case of a fully materialized data cube, the query answering time is still kept linear in the size of the transient segment. Our extensive experimental results on both synthetic and real data sets illustrate the efficiency and the scalability of our design.
机译:在某些业务应用程序中,例如金融机构中的交易管理,需要准确地回答数据流上的临时汇总查询。实现和增量维护完整的数据立方体,甚至对其进行压缩或逼近数据流,通常在计算上是令人望而却步的。另一方面,尽管先前的研究提出了用于连续聚合查询的近似方法,但它们无法提供准确的答案。在本文中,我们开发了一种新颖的前缀聚合树(PAT)结构,用于在线仓库数据流和回答临时聚合查询。通常,数据流可以分为历史段和过渡段,该历史段存储在传统数据仓库中,而瞬变段可以存储在PAT中,以回答临时聚合查询。 PAT的大小在瞬态段的大小上是线性的,并且只需要对数据流进行一次扫描即可创建和增量维护PAT。尽管使用PAT进行查询答复要比完全实现数据立方体的情况花费更多,但查询答复时间仍然保持瞬态段大小的线性。我们在综合和真实数据集上的广泛实验结果说明了我们设计的效率和可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号