...
首页> 外文期刊>Journal of software >An Algorithm for Mining Frequent Itemsets from Library Big Data
【24h】

An Algorithm for Mining Frequent Itemsets from Library Big Data

机译:一种从图书馆大数据中挖掘频繁项集的算法

获取原文
获取原文并翻译 | 示例
           

摘要

Frequent itemset mining plays an important part in college library data analysis. Because there are a lot of redundant data in library database, the mining process may generate intra-property frequent itemsets, and this hinders its efficiency significantly. To address this issue, we propose an improved FP-Growth algorithm we call RFP-Growth to avoid generating intra-property frequent itemsets, and to further boost its efficiency, implement its MapReduce version with additional prune strategy. The proposed algorithm was tested using both synthetic and real world library data, and the experimental results showed that the proposed algorithm outperformed existing algorithms.
机译:频繁项集挖掘在大学图书馆数据分析中起着重要的作用。由于库数据库中有大量冗余数据,因此挖掘过程可能会生成属性内频繁项集,这会大大降低其效率。为解决此问题,我们提出了一种改进的FP-Growth算法,称为RFP-Growth,以避免生成属性内频繁项集,并进一步提高其效率,并通过附加修剪策略实现其MapReduce版本。对该算法进行了综合和真实库数据测试,实验结果表明该算法优于现有算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号