...
首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Maintaining sliding window skylines on data streams
【24h】

Maintaining sliding window skylines on data streams

机译:维持数据流上滑动的窗口天际线

获取原文
获取原文并翻译 | 示例
           

摘要

The skyline of a multidimensional data set contains the "best" tuples according to any preference function that is monotonic on each dimension. Although skyline computation has received considerable attention in conventional databases, the existing algorithms are inapplicable to stream applications because 1) they assume static data that are stored in the disk (rather than continuously arriving/expiring), 2) they focus on "one-time" execution that returns a single skyline (in contrast to constantly tracking skyline changes), and 3) they aim at reducing the I/O overhead (as opposed to minimizing the CPU-cost and main-memory consumption). This paper studies skyline computation in stream environments, where query processing takes into account only a "sliding window" covering the most recent tuples. We propose algorithms that continuously monitor the incoming data and maintain the skyline incrementally. Our techniques utilize several interesting properties of stream skylines to improve space/time efficiency by expunging data from the system as early as possible (i.e., before their expiration). Furthermore, we analyze the asymptotical performance of the proposed solutions, and evaluate their efficiency with extensive experiments.
机译:多维数据集的天际线根据在每个维度上单调的任何首选项函数包含“最佳”元组。尽管天际线计算已在常规数据库中引起了广泛关注,但现有算法不适用于流应用程序,因为1)它们假定存储在磁盘中的静态数据(而不是连续到达/到期),2)他们专注于“一次性返回单一天际线的执行(与不断跟踪天际线的变化形成对比),以及3)它们旨在减少I / O开销(而不是最小化CPU成本和主内存消耗)。本文研究流环境中的天际线计算,其中查询处理仅考虑覆盖最新元组的“滑动窗口”。我们提出了可连续监视传入数据并逐步维护天际线的算法。我们的技术利用河流天际线的几个有趣属性,通过尽早(即在其过期之前)清除系统中的数据来提高空间/时间效率。此外,我们分析了所提出解决方案的渐近性能,并通过大量实验评估了它们的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号