首页> 外文学位 >Rough set approach to feature reduction in KDD: Evolutionary computing and data sampling.
【24h】

Rough set approach to feature reduction in KDD: Evolutionary computing and data sampling.

机译:减少KDD中的特征的粗糙集方法:进化计算和数据采样。

获取原文
获取原文并翻译 | 示例

摘要

We develop a framework for parallel computation of rough set decision reducts from data. We adapt the island model for evolutionary computing. The idea is to optimize reducts within separate populations (islands) and enable the best reducts-chromosomes to migrate among islands. Experiments show that the proposed method speeds up calculations and often provides better quality results compared to genetic algorithms applied to date to the feature reduction (aka feature selection or attribute reduction). We continue by extending our genetic algorithm framework onto dynamic reducts, based on random sampling of datasets. Decision rules generated from the best dynamic reducts proved to be stable and highly accurate classifiers. However, calculation of dynamic reducts and corresponding rules is even more expensive in regards of time and memory than in the case of classical rough set decision reducts. In parallel to improving the calculation speed, we investigate our own two extensions of dynamic reducts (called multi-reducts and dynamic semi-reducts), which can be easily optimized using hybrid, order-based genetic algorithms (OGAs). We evaluate our methods with KDD CUP 1999 datasets. The proposed extensions outperform the speed of conventional dynamic reduct calculation while preserving the quality of the resulting classifiers.
机译:我们开发了一个框架,用于并行计算数据的粗糙集决策约简。我们将孤岛模型用于进化计算。这个想法是在单独的种群(岛屿)中优化还原,并使最佳的还原染色体在岛屿之间迁移。实验表明,与迄今应用于特征约简(即特征选择或属性约简)的遗传算法相比,该方法可加快计算速度并通常提供更好的质量结果。我们将基于数据集的随机抽样,将遗传算法框架扩展到动态归约中。最佳动态还原产生的决策规则被证明是稳定且高度准确的分类器。但是,就时间和内存而言,动态约简和相应规则的计算比传统的粗糙集决策约简的成本更高。在提高计算速度的同时,我们研究了动态归约的两个扩展(称为多重归约和动态半归约),可以使用基于顺序的混合遗传算法(OGA)轻松对其进行优化。我们使用KDD CUP 1999数据集评估我们的方法。拟议的扩展性能优于常规动态归约计算的速度,同时保留了所得分类器的质量。

著录项

  • 作者

    Rahman, Mohammad Mahibour.;

  • 作者单位

    The University of Regina (Canada).;

  • 授予单位 The University of Regina (Canada).;
  • 学科 Computer Science.
  • 学位 M.Sc.
  • 年度 2006
  • 页码 102 p.
  • 总页数 102
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号