首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >AIDE: An Active Learning-Based Approach for Interactive Data Exploration
【24h】

AIDE: An Active Learning-Based Approach for Interactive Data Exploration

机译:AIDE:一种基于主动学习的交互式数据探索方法

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we argue that database systems be augmented with an automated data exploration service that methodically steers users through the data in a meaningful way. Such an automated system is crucial for deriving insights from complex datasets found in many big data applications such as scientific and healthcare applications as well as for reducing the human effort of data exploration. Towards this end, we present AIDE, an Automatic Interactive Data Exploration framework that assists users in discovering new interesting data patterns and eliminate expensive ad-hoc exploratory queries. AIDE relies on a seamless integration of classification algorithms and data management optimization techniques that collectively strive to accurately learn the user interests based on his relevance feedback on strategically collected samples. We present a number of exploration techniques as well as optimizations that minimize the number of samples presented to the user while offering interactive performance. AIDE can deliver highly accurate query predictions for very common conjunctive queries with small user effort while, given a reasonable number of samples, it can predict with high accuracy complex disjunctive queries. It provides interactive performance as it limits the user wait time per iteration of exploration to less than a few seconds.
机译:在本文中,我们认为数据库系统将通过自动的数据探索服务得到扩展,该服务以有意义的方式有条不紊地引导用户浏览数据。这样的自动化系统对于从许多大数据应用程序(例如科学和医疗保健应用程序)中发现的复杂数据集获取洞察力以及减少人工数据探索工作至关重要。为此,我们提出了AIDE,这是一个自动交互式数据探索框架,可帮助用户发现新的有趣数据模式并消除昂贵的临时探索性查询。 AIDE依赖于分类算法和数据管理优化技术的无缝集成,这些技术基于对策略性收集的样本的相关性反馈,共同努力准确地学习用户的兴趣。我们提供了许多探索技术和优化方法,这些方法和优化方法在提供交互式性能的同时,最大限度地减少了呈现给用户的样本数量。 AIDE可以用很少的工作量为非常常见的联合查询提供高度准确的查询预测,而在给定合理数量的样本的情况下,AIDE可以以高精度进行复杂的联合查询的预测。它提供交互式性能,因为它将每次探索迭代的用户等待时间限制在几秒钟以内。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号