首页> 美国卫生研究院文献>Elsevier Public Health Emergency Collection >Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism
【2h】

Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism

机译:具有多个最小支持的挖掘关联规则:新的挖掘算法和支持调整机制

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Mining association rules with multiple minimum supports is an important generalization of the association-rule-mining problem, which was recently proposed by Liu et al. Instead of setting a single minimum support threshold for all items, they allow users to specify multiple minimum supports to reflect the natures of the items, and an Apriori-based algorithm, named MSapriori, is developed to mine all frequent itemsets. In this paper, we study the same problem but with two additional improvements. First, we propose a FP-tree-like structure, MIS-tree, to store the crucial information about frequent patterns. Accordingly, an efficient MIS-tree-based algorithm, called the CFP-growth algorithm, is developed for mining all frequent itemsets. Second, since each item can have its own minimum support, it is very difficult for users to set the appropriate thresholds for all items at a time. In practice, users need to tune items' supports and run the mining algorithm repeatedly until a satisfactory end is reached. To speed up this time-consuming tuning process, an efficient algorithm which can maintain the MIS-tree structure without rescanning database is proposed. Experiments on both synthetic and real-life datasets show that our algorithms are much more efficient and scalable than the previous algorithm.
机译:挖掘具有多个最小支持的关联规则是对关联规则挖掘问题的重要概括,这是Liu等人最近提出的。他们没有为所有项目设置单个最小支持阈值,而是允许用户指定多个最小支持以反映项目的性质,并且开发了一种名为MSapriori的基于Apriori的算法来挖掘所有频繁的项目集。在本文中,我们研究了相同的问题,但还有两个其他改进。首先,我们提出了一种类似于FP树的结构,即MIS树,用于存储有关频繁模式的关键信息。因此,开发了一种有效的基于MIS树的算法,称为CFP增长算法,用于挖掘所有频繁项集。其次,由于每个项目都可以拥有自己的最小支持,因此用户很难一次为所有项目设置适当的阈值。实际上,用户需要调整项目的支持并重复运行挖掘算法,直到达到满意的效果为止。为了加快此耗时的调整过程,提出了一种无需重新扫描数据库即可维持MIS树结构的高效算法。在合成数据集和真实数据集上的实验表明,我们的算法比以前的算法更加有效和可扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号