...
首页> 外文期刊>Amino acids >Combination use of protein-protein interaction network topological features improves the predictive scores of deleterious non-synonymous single-nucleotide polymorphisms
【24h】

Combination use of protein-protein interaction network topological features improves the predictive scores of deleterious non-synonymous single-nucleotide polymorphisms

机译:蛋白质-蛋白质相互作用网络拓扑特征的组合使用可提高有害的非同义单核苷酸多态性的预测得分

获取原文
获取原文并翻译 | 示例
           

摘要

Single-nucleotide polymorphisms (SNPs) are the most frequent form of genetic variations. Non-synonymous SNPs (nsSNPs) occurring in coding region result in single amino acid substitutions that associate with human hereditary diseases. Plenty of approaches were designed for distinguishing deleterious from neutral nsSNPs based on sequence level information. Novel in this work, combinations of protein-protein interaction (PPI) network topological features were introduced in predicting disease-related nsSNPs. Based on a dataset that was compiled from Swiss-Prot, a random forest model was constructed with an average accuracy value of 80.43 % and an MCC value of 0.60 in a rigorous tenfold crossvalidation test. For an independent dataset, our model achieved an accuracy of 88.05 % and an MCC of 0.67. Compared with previous studies, our approach presented superior prediction ability. Results showed that the incorporated PPI network topological features outperform conventional features. Our further analysis indicated that disease-related proteins are topologically different from other proteins. This study suggested that nsSNPs may share some topological information of proteins and the change of topological attributes could provide clues in illustrating functional shift due to nsSNPs.
机译:单核苷酸多态性(SNP)是遗传变异的最常见形式。在编码区中出现的非同义SNP(nsSNP)会导致与人类遗传性疾病相关的单个氨基酸取代。设计了许多基于序列水平信息来区分中性nsSNP和有害nsSNP的方法。在这项工作中的新颖之处在于,引入了蛋白质-蛋白质相互作用(PPI)网络拓扑特征的组合来预测与疾病相关的nsSNP。根据Swiss-Prot编译的数据集,在严格的十倍交叉验证测试中,构建了随机森林模型,其平均准确度值为80.43%,MCC值为0.60。对于独立的数据集,我们的模型实现了88.05%的准确度和0.67的MCC。与以前的研究相比,我们的方法具有更好的预测能力。结果表明,合并的PPI网络拓扑特征优于常规特征。我们的进一步分析表明,疾病相关蛋白在拓扑结构上不同于其他蛋白。这项研究表明nsSNP可能共享蛋白质的某些拓扑信息,并且拓扑属性的变化可能为说明nsSNP引起的功能转移提供线索。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号