首页> 外文期刊>Pattern recognition letters >Feature selection by maximizing correlation information for integrated high-dimensional protein data
【24h】

Feature selection by maximizing correlation information for integrated high-dimensional protein data

机译:通过最大化关联信息以集成高维蛋白质数据进行特征选择

获取原文
获取原文并翻译 | 示例
           

摘要

This paper primarily proposes a novel method, named maximum correlation information (MCI), to evaluate the importance of each feature by maximizing correlation information between feature space and class coding space. Then the proposed MCI is combined with the strategy of recursive feature elimination to form a new feature selection method, MCI-based recursive feature elimination (MCI-RFE). MCI-RFE aims to select the optimal features with lower time complexity. To validate the performance of the proposed method, random 10-fold cross-validation is applied thirty times on six widely used benchmark datasets with integrated features from position-specific score matrix (PSSM), PROFEAT and Gene Ontology (GO). The experimental results show that MCI-RFE is highly competitive and works effectively for integrated high-dimensional protein data compared to the other state-of-the-art algorithms including SVM-RFE, ReliefF-RFE and Random-Forest. (C) 2017 Elsevier B.V. All rights reserved.
机译:本文主要提出一种称为最大相关信息(MCI)的新方法,通过最大化特征空间和类编码空间之间的相关信息来评估每个特征的重要性。然后将提出的MCI与递归特征消除策略相结合,形成了一种新的特征选择方法,即基于MCI的递归特征消除(MCI-RFE)。 MCI-RFE旨在选择时间复杂度较低的最佳功能。为了验证所提出方法的性能,对六个广泛使用的基准数据集进行了十次随机十倍交叉验证,该数据具有来自特定位置评分矩阵(PSSM),PROFEAT和基因本体论(GO)的集成功能。实验结果表明,与其他最先进的算法(包括SVM-RFE,ReliefF-RFE和Random-Forest)相比,MCI-RFE具有很高的竞争力,并且可以有效地处理集成的高维蛋白质数据。 (C)2017 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《Pattern recognition letters》 |2017年第1期|17-24|共8页
  • 作者单位

    Xiamen Univ, Dept Automat, Xiamen 361005, Fujian, Peoples R China|York Univ, Sch Informat Technol, Toronto, ON M3J 1P3, Canada;

    York Univ, Sch Informat Technol, Toronto, ON M3J 1P3, Canada;

    Xiamen Univ, Dept Automat, Xiamen 361005, Fujian, Peoples R China|York Univ, Sch Informat Technol, Toronto, ON M3J 1P3, Canada;

    Xiamen Univ, Dept Automat, Xiamen 361005, Fujian, Peoples R China|Xiamen Univ, Innovat Ctr Cell Signaling Network, Xiamen 361102, Fujian, Peoples R China|Natl Ctr Healthcare Big Data, Xiamen Res Inst, Xiamen 361005, Fujian, Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Feature selection; Recursive feature elimination (RFE); Support vector machine (SVM); ReliefF; Random-Forest;

    机译:特征选择;递归特征消除(RFE);支持向量机(SVM);ReliefF;Random-Forest;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号