首页> 外文会议>2015 IEEE Seventh International Conference on Intelligent Computing and Information Systems >Efficient breast cancer detection using sequential feature selection techniques
【24h】

Efficient breast cancer detection using sequential feature selection techniques

机译:使用顺序特征选择技术的高效乳腺癌检测

获取原文
获取原文并翻译 | 示例

摘要

Breast cancer is one of the most dangerous cancers in the world especially in the Arab countries and Egypt. Due to the large spreading of the disease, automatic recognition systems can help physicians to classify the tumors as benign or malignant. However, performing a lot of pathological analysis consumes time and money. In this paper, we propose an algorithm for decreasing the number of features required to detect the tumor. Two classifiers are chosen to test the classification accuracy; linear and quadratic. The experimental results show that, there are strong correlations between the features in the data set. When using the sequential feature selection algorithm, results show that, discarding more than 50% of the features has no significant loss on classification accuracy when using the quadratic discriminate classifier. Additionally, only four PCA components can be used with the same accuracy as using nine components when being classified by the linear discriminate classifier. Additionally, the outliers in the data set have no notable effect on the classification accuracy. The data set is proved to be homogenous using the k-means clustering algorithm.
机译:乳腺癌是世界上最危险的癌症之一,尤其是在阿拉伯国家和埃及。由于疾病的广泛传播,自动识别系统可以帮助医生将肿瘤分类为良性或恶性。但是,进行大量病理分析会浪费时间和金钱。在本文中,我们提出了一种用于减少检测肿瘤所需特征数量的算法。选择两个分类器以测试分类准确性;线性和二次方。实验结果表明,数据集中的特征之间存在很强的相关性。结果表明,当使用顺序特征选择算法时,使用二次判别分类器时,丢弃超过50%的特征不会对分类精度造成重大损失。另外,在通过线性区分分类器进行分类时,只能使用与使用九个组件相同的精度来使用四个PCA组件。此外,数据集中的异常值对分类准确性没有显着影响。使用k均值聚类算法证明数据集是同质的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号