首页> 外文期刊>BMC Bioinformatics >Prediction of peptides binding to MHC class I and II alleles by temporal motif mining
【24h】

Prediction of peptides binding to MHC class I and II alleles by temporal motif mining

机译:通过时间基序挖掘预测与MHC I和II类MHC等位基因结合的肽

获取原文
           

摘要

BackgroundMHC (Major Histocompatibility Complex) is a key player in the immune response of most vertebrates. The computational prediction of whether a given antigenic peptide will bind to a specific MHC allele is important in the development of vaccines for emerging pathogens, the creation of possibilities for controlling immune response, and for the applications of immunotherapy. One of the problems that make this computational prediction difficult is the detection of the binding core region in peptides, coupled with the presence of bulges and loops causing variations in the total sequence length. Most machine learning methods require the sequences to be of the same length to successfully discover the binding motifs, ignoring the length variance in both motif mining and prediction steps. In order to overcome this limitation, we propose the use of time-based motif mining methods that work position-independently.ResultsThe prediction method was tested on a benchmark set of 28 different alleles for MHC class I and 27 different alleles for MHC class II. The obtained results are comparable to the state of the art methods for both MHC classes, surpassing the published results for some alleles. The average prediction AUC values are 0.897 for class I, and 0.858 for class II.ConclusionsTemporal motif mining using partial periodic patterns can capture information about the sequences well enough to predict the binding of the peptides and is comparable to state of the art methods in the literature. Unlike neural networks or matrix based predictors, our proposed method does not depend on peptide length and can work with both short and long fragments. This advantage allows better use of the available training data and the prediction of peptides of uncommon lengths.
机译:背景MHC(主要组织相容性复合体)是大多数脊椎动物免疫反应的关键因素。给定抗原性肽是否会结合特定的MHC等位基因的计算预测对于新兴病原体疫苗的开发,控制免疫应答的可能性以及免疫疗法的应用非常重要。使得该计算预测困难的问题之一是检测肽中的结合核心区域,以及存在凸起和环,导致总序列长度的变化。大多数机器学习方法要求序列具有相同的长度才能成功发现结合基序,而忽略了基序挖掘和预测步骤中的长度差异。为了克服此限制,我们建议使用基于时间的基序挖掘方法,这些方法独立于位置运行。结果在MHC I类的28个不同等位基因和II MHC的27个不同等位基因的基准集上测试了预测方法。对于两个MHC类,所获得的结果均与现有技术水平相当,超过了某些等位基因的已发表结果。对于I类,平均预测AUC值为0.897,对于II类,平均预测AUC值为0.858。结论使用部分周期性模式的时态基序挖掘可以很好地捕获有关序列的信息,以预测肽的结合,并且与现有技术水平相当。文学。与基于神经网络或基于矩阵的预测变量不同,我们提出的方法不依赖于肽段长度,并且可以用于短片段和长片段。该优点允许更好地利用可用的训练数据和预测不常见长度的肽。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号