首页> 外文期刊>International Journal of Engineering Trends and Technology >Analysis of Feature Selection Algorithms and a Comparative study on Heterogeneous Classifier for High Dimensional Data survey
【24h】

Analysis of Feature Selection Algorithms and a Comparative study on Heterogeneous Classifier for High Dimensional Data survey

机译:高维数据调查的特征选择算法分析和异构分类器的比较研究

获取原文
           

摘要

This paper focuses on the analysis of various feature selection algorithms and a comparative study on heterogeneous classifier predictive accuracy problems to work with high dimensional data. Especially we conduct experimental comparisons of IBK (KNN), SVM, NBTree and J48 on KDD Cup99 intrusion detection dataset and one cancer disease diagnosis microarray datasets and analysis their performance with vote generalizations. Based on the fact a large number of features can cause a noise of data and degrades a performance of learning algorithm.To tackle these problems identifying a suitable feature selection method is essential for a given machine learning algorithm tasks. So feature selection plays a great role in intrusion detection, bioinformatics, and medical data analysis. Thus this paper deals the application of best feature selection techniques to improve learning algorithm predictive accuracy in microarray dataset and KDD (Knowledge Discovery and Data Mining Tools Conference) Cup 99 dataset with a respective classification and feature selection algorithms. basically, this approach shows the application of feature selection algorithms when a large number of features represented in a small sample data and small numbers of features represented with a high number of samples by taking the above two different datasets.
机译:本文着重分析各种特征选择算法,并比较研究适用于高维数据的异构分类器预测精度问题。特别是,我们在KDD Cup99入侵检测数据集和一个癌症疾病诊断微阵列数据集上进行了IBK(KNN),SVM,NBTree和J48的实验比较,并通过投票概括分析了它们的性能。基于这样的事实,大量特征会导致数据噪声并降低学习算法的性能。为解决这些问题,对于给定的机器学习算法任务,确定合适的特征选择方法至关重要。因此,特征选择在入侵检测,生物信息学和医学数据分析中起着重要作用。因此,本文探讨了最佳特征选择技术的应用,以提高微阵列数据集和KDD(知识发现和数据挖掘工具会议)Cup Cup 99数据集中具有各自分类和特征选择算法的学习算法的预测准确性。基本上,此方法通过采用上述两个不同的数据集,在小样本数据中表示大量特征而小样本数据中表示少量特征时显示了特征选择算法的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号