首页> 外文会议>International Symposium on Current Progress in Mathematics and Sciences >Correlated based SVM-RFE as feature selection for cancer classification using microarray databases
【24h】

Correlated based SVM-RFE as feature selection for cancer classification using microarray databases

机译:基于相关的SVM-RFE作为使用微阵列数据库的癌症分类的特征选择

获取原文

摘要

A lot of research about cancer dataset classification has been done to decrease the number of death caused by cancer. Cancer microarray dataset consists of a large number of features that if we use all of them will spend time, cost, and memory capacities. It necessary to reduce the number of features using feature selection. We need to choose a feature selection method that not only eliminate the irrelevant features, but also consider the existence of correlated genes. If we ignore the correlated genes, it will lead to the disappearance of important information about cancer itself. To prove that feature selection will give higher accuracy, this research will compare the accuracy between classification of datasets without feature selection and with feature selection. This research use CSVM-RFE as feature selection method. To classify, this research use SVM and KFCM with two different kernel types, that is Gaussian RBF Kernel with σ = 0.05 and Polynomial Kernel with degree = 3. Those methods are applied on three different cancer datasets. As a result, highest accuracy of colon cancer dataset is 98.6 % using SVM based RBF Kernel. Highest accuracy of prostate cancer dataset is 99.2 % using SVM based polynomial kernel, and highest accuracy of lymphoma cancer dataset is 99.1 % using SVM based RBF kernel.
机译:已经进行了大量关于癌症数据集分类的研究以减少癌症引起的死亡人数。癌症微阵列数据集包括大量功能,如果我们使用所有这些都将花费时间,成本和内存容量。有必要使用特征选择来减少功能的数量。我们需要选择一个特征选择方法,不仅消除了无关的特征,而且还考虑存在相关基因的存在。如果我们忽略相关基因,它将导致关于癌症本身的重要信息的消失。为了证明特征选择将提供更高的准确性,该研究将比较数据集分类之间的准确性,而无需特征选择和特征选择。本研究使用CSVM-RFE作为特征选择方法。要对此进行分类,本研究使用具有两种不同内核类型的SVM和KFCM,即高斯RBF内核,具有σ= 0.05和具有度的多项式内核= 3.这些方法应用于三种不同的癌症数据集。结果,使用基于SVM的RBF内核的结肠癌数据集的最高精度为98.6%。前列腺癌数据集的最高精度使用基于SVM的多项式内核为99.2%,并且使用基于SVM的RBF内核的淋巴瘤癌数据集的最高精度为99.1%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号