首页> 外文会议>Advances in knowledge discovery and data mining >Multiple Seeds Based Evolutionary Algorithm for Mining Boolean Association Rules
【24h】

Multiple Seeds Based Evolutionary Algorithm for Mining Boolean Association Rules

机译:挖掘布尔关联规则的基于多种子的进化算法

获取原文
获取原文并翻译 | 示例

摘要

Most of the association rule mining algorithms use a single seed for initializing a population without paying attention to the effectiveness of an initial population in an evolutionary learning. Recently, researchers show that an initial population has significant effects on producing good solutions over several generations of a genetic algorithm. There are two significant challenges raised by single seed based genetic algorithms for real world applications: (1) solutions of a genetic algorithm are varied, since different seeds generate different initial populations, (2) it is a hard process to define an effective seed for a specific application. To avoid these problems, in this paper we propose a new multiple seeds based genetic algorithm (MSGA) which generates multiple seeds from different domains of a solution space to discover high quality rules from a large data set. This approach introduces m-domain model and m-seeds selection process through which the whole solution space is subdivided into m-number of same size domains and from each domain it selects a seed. By using these seeds, this method generates an effective initial population to perform an evolutionary learning of the fitness value of each rule. As a result, this method obtains strong searching efficiency at the beginning of the evolution and achieves fast convergence along with the evolution. MSGA is tested with different mutation and crossover operators for mining interesting Boolean association rules from different real world data sets and compared the results with different single seeds based genetic algorithms.
机译:大多数关联规则挖掘算法使用单个种子来初始化种群,而不关注进化学习中初始种群的有效性。最近,研究人员表明,初始种群对遗传算法几代人产生良好解决方案具有重要影响。基于单种子的遗传算法在现实世界中的应用提出了两个重大挑战:(1)遗传算法的解决方案是多种多样的,因为不同的种子会产生不同的初始种群,(2)为一个特定的应用程序。为了避免这些问题,在本文中,我们提出了一种新的基于多种子的遗传算法(MSGA),该算法从解决方案空间的不同域生成多个种子,以从大型数据集中发现高质量规则。这种方法引入了m域模型和m种子选择过程,通过该过程将整个解决方案空间细分为m个大小相同的域,并从每个域中选择一个种子。通过使用这些种子,此方法生成有效的初始种群,以对每个规则的适用性值进行进化学习。结果,该方法在演化开始时获得了强大的搜索效率,并且随着演化而实现了快速收敛。使用不同的变异和交叉算子对MSGA进行了测试,以从不同的现实世界数据集中挖掘有趣的布尔关联规则,并将结果与​​基于不同种子的遗传算法进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号