首页> 外文期刊>Neurocomputing >Effective search for genetic-based machine learning systems via estimation of distribution algorithms and embedded feature reduction techniques
【24h】

Effective search for genetic-based machine learning systems via estimation of distribution algorithms and embedded feature reduction techniques

机译:通过估计分布算法和嵌入式特征约简技术有效搜索基于遗传的机器学习系统

获取原文
获取原文并翻译 | 示例
           

摘要

Genetic-based machine learning (GBML) systems, which employ evolutionary algorithms (EAs) as search mechanisms, evolve rule-based classification models to represent target concepts. Compared to Michigan-style GBML, Pittsburgh-style GBML is expected to achieve more compact solutions. It has been shown that standard recombination operators in EAs do not assure an effective evolutionary search to solve sophisticated problems that contain strong interactions between features. On the other hand, when dealing with real-world classification tasks, irrelevant features not only complicate the problem but also incur unnecessary matchings in GBML systems, which increase the computational cost a lot. To handle the two problems mentioned above in an integrated manner, a new Pittsburgh-style GBML system is proposed. In the proposed method, classifiers are generated and recombined at two levels. At the high level, classifiers are recombined by rule-wise uniform crossover operators since each classifier consists of a variable-size rule set. At the low level, single rules contained in classifiers are reproduced via sampling Bayesian networks that characterize the global statistical information extracted from promising rules found so far. Furthermore, according to the statistical information in the rule population, an embedded approach is presented to detect and remove redundant features incrementally following the evolution of rule population. Results of empirical evaluation show that the proposed method outperforms the original Pittsburgh-style GBML system in terms of classification accuracy while reducing the computational cost. Furthermore, the proposed method is also competitive to other non-evolutionary, highly used machine learning methods. With respect to the performance of feature reduction, the proposed embedded approach is able to deliver solutions with higher classification accuracy when removing the same number of features as other feature reduction techniques do.
机译:基于遗传的机器学习(GBML)系统采用进化算法(EA)作为搜索机制,它进化了基于规则的分类模型来表示目标概念。与密歇根州风格的GBML相比,匹兹堡风格的GBML有望实现更紧凑的解决方案。已经表明,EA中的标准重组算子不能保证有效的进化搜索来解决复杂的问题,这些复杂的问题包含功能之间的强大相互作用。另一方面,在处理现实世界中的分类任务时,不相关的功能不仅使问题复杂化,而且在GBML系统中会引起不必要的匹配,从而大大增加了计算成本。为了综合解决上述两个问题,提出了一种新的匹兹堡式GBML系统。在提出的方法中,分类器被生成并在两个级别上重新组合。在高层,分类器由规则方式统一的交叉运算符重新组合,因为每个分类器都包含一个可变大小的规则集。在低层次上,分类器中包含的单个规则是通过采样贝叶斯网络来再现的,贝叶斯网络表征了从迄今为止发现的有前途的规则中提取的全局统计信息。此外,根据规则总体的统计信息,提出了一种嵌入式方法,用于随着规则总体的发展而逐步检测和去除冗余特征。实证评估结果表明,该方法在分类精度上优于匹兹堡式GBML系统,同时降低了计算成本。此外,所提出的方法还与其他非进化的,高度使用的机器学习方法竞争。关于特征缩减的性能,当与其他特征缩减技术删除相同数量的特征时,所提出的嵌入式方法能够提供具有更高分类精度的解决方案。

著录项

  • 来源
    《Neurocomputing》 |2013年第3期|105-121|共17页
  • 作者

    Jiadong Yang; Hua Xu; Peifa Jia;

  • 作者单位

    Jike.com, Beijing 100020, China;

    State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China;

    State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    genetic-based machine learning systems; estimation of distribution algorithms; features reduction; evolutionary computation;

    机译:基于遗传的机器学习系统;分布算法的估计;特征约简;进化计算;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号