首页> 外文期刊>Journal of Computer Science & Technology >Efficient Incremental Maintenance for Distributive and Non-Distributive Aggregate Functions
【24h】

Efficient Incremental Maintenance for Distributive and Non-Distributive Aggregate Functions

机译:分布式和非分布式聚合功能的有效增量维护

获取原文
获取原文并翻译 | 示例
           

摘要

Data cube pre-computation is an important concept for supporting OLAP (Online Analytical Processing) and has been studied extensively. It is often not feasible to compute a complete data cube due to the huge storage requirement. Recently proposed quotient cube addressed this issue through a partitioning method that groups cube cells into equivalence partitions. Such an approach not only is useful for distributive aggregate functions such as SUM but also can be applied to the maintenance of holistic aggregate functions like MEDIAN which will require the storage of a set of tuples for each equivalence class. Unfortunately, as changes are made to the data sources, maintaining the quotient cube is non-trivial since the partitioning of the cube cells must also be updated. In this paper, the authors design incremental algorithms to update a quotient cube efficiently for both SUM and MEDIAN aggregate functions. For the aggregate function SUM, concepts are borrowed from the principle of Galois Lattice to develop CPU-efficient algorithms to update a quotient cube. For the aggregate function MEDIAN, the concept of a pseudo class is introduced to further reduce the size of the quotient cube. Coupled with a novel sliding window technique, an efficient algorithm is developed for maintaining a MEDIAN quotient cube that takes up reasonably small storage space. Performance study shows that the proposed algorithms are efficient and scalable over large databases.
机译:数据多维数据集预计算是支持OLAP(在线分析处理)的重要概念,并且已经进行了广泛的研究。由于巨大的存储需求,计算一个完整的数据立方体通常是不可行的。最近提出的商立方通过一种将立方单元格分组为等效分区的分区方法解决了此问题。这种方法不仅对诸如SUM之类的分布式聚合函数有用,而且可以应用于维护诸如MEDIAN的整体聚合函数,这将需要为每个等价类存储一组元组。不幸的是,由于对数据源进行了更改,因此商立方的维护并不容易,因为必须同时更新立方单元的分区。在本文中,作者设计了增量算法来针对SUM和MEDIAN聚合函数有效地更新商立方。对于聚合函数SUM,从Galois Lattice的原理中借鉴了一些概念,以开发CPU效率高的算法来更新商立方。对于聚合函数MEDIAN,引入了伪类的概念以进一步减小商立方的大小。结合新颖的滑动窗口技术,开发了一种有效的算法,用于维护占用相当小的存储空间的MEDIAN商立方。性能研究表明,所提出的算法在大型数据库上高效且可扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号