首页> 外文会议> >Protein-Protein Interaction Prediction Based on Sequence Data by Support Vector Machine with Probability Assignment
【24h】

Protein-Protein Interaction Prediction Based on Sequence Data by Support Vector Machine with Probability Assignment

机译:基于序列数据的概率分配支持向量机的蛋白质-蛋白质相互作用预测

获取原文

摘要

In this paper, we investigate the sequence-based protein-protein interaction prediction by machine learning methods. Specifically, we propose to build classifiers in the space of domain pairs, which are purely based on sequence data. We designed a novel way to select negative samples using a classification-based iterative voting procedure, and systematically compared the effects of negative sample selection on the performance of classification. We also propose an approach to estimate the probabilities for the predictions by SVM. Based on the selected negative samples, we compared nonlinear SVM based on gaussian kernel, linear SVM and linear logistic regression for both classification performance and probability assignments. Our results show that the probability assigned by SVM is more natural than logistic regression, and SVM also outperforms logistic regression for prediction.
机译:在本文中,我们研究了通过机器学习方法进行的基于序列的蛋白质-蛋白质相互作用预测。具体来说,我们建议在域对的空间中构建分类器,这些分类器完全基于序列数据。我们设计了一种使用基于分类的迭代投票程序选择否定样本的新颖方法,并系统地比较了否定样本选择对分类性能的影响。我们还提出了一种通过SVM估计预测概率的方法。基于选择的负样本,我们比较了基于高斯核的非线性SVM,线性SVM和线性Logistic回归的分类性能和概率分配。我们的结果表明,SVM分配的概率比逻辑回归更自然,并且SVM在预测方面也胜过逻辑回归。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号