...
首页> 外文期刊>BMC Structural Biology >Sequence and structural features of carbohydrate binding in proteins and assessment of predictability using a neural network
【24h】

Sequence and structural features of carbohydrate binding in proteins and assessment of predictability using a neural network

机译:蛋白质中碳水化合物结合的序列和结构特征以及使用神经网络评估可预测性

获取原文
           

摘要

Background Protein-Carbohydrate interactions are crucial in many biological processes with implications to drug targeting and gene expression. Nature of protein-carbohydrate interactions may be studied at individual residue level by analyzing local sequence and structure environments in binding regions in comparison to non-binding regions, which provide an inherent control for such analyses. With an ultimate aim of predicting binding sites from sequence and structure, overall statistics of binding regions needs to be compiled. Sequence-based predictions of binding sites have been successfully applied to DNA-binding proteins in our earlier works. We aim to apply similar analysis to carbohydrate binding proteins. However, due to a relatively much smaller region of proteins taking part in such interactions, the methodology and results are significantly different. A comparison of protein-carbohydrate complexes has also been made with other protein-ligand complexes. Results We have compiled statistics of amino acid compositions in binding versus non-binding regions- general as well as in each different secondary structure conformation. Binding propensities of each of the 20 residue types and their structure features such as solvent accessibility, packing density and secondary structure have been calculated to assess their predisposition to carbohydrate interactions. Finally, evolutionary profiles of amino acid sequences have been used to predict binding sites using a neural network. Another set of neural networks was trained using information from single sequences and the prediction performance from the evolutionary profiles and single sequences were compared. Best of the neural network based prediction could achieve an 87% sensitivity of prediction at 23% specificity for all carbohydrate-binding sites, using evolutionary information. Single sequences gave 68% sensitivity and 55% specificity for the same data set. Sensitivity and specificity for a limited galactose binding data set were obtained as 63% and 79% respectively for evolutionary information and 62% and 68% sensitivity and specificity for single sequences. Propensity and other sequence and structural features of carbohydrate binding sites have also been compared with our similar extensive studies on DNA-binding proteins and also with protein-ligand complexes. Conclusion Carbohydrates typically show a preference to bind aromatic residues and most prominently tryptophan. Higher exposed surface area of binding sites indicates a role of hydrophobic interactions. Neural networks give a moderate success of prediction, which is expected to improve when structures of more protein-carbohydrate complexes become available in future.
机译:背景技术蛋白质与碳水化合物的相互作用在许多生物学过程中至关重要,这涉及药物靶向和基因表达。通过与非结合区相比分析结合区中的局部序列和结构环境,可以在单个残基水平上研究蛋白质与碳水化合物相互作用的性质,这为此类分析提供了内在的控制。为了从序列和结构预测结合位点的最终目的,需要编制结合区的总体统计数据。基于序列的结合位点预测已成功应用于我们早期工作中的DNA结合蛋白。我们旨在将类似的分析应用于碳水化合物结合蛋白。但是,由于参与这种相互作用的蛋白质区域相对较小,因此方法和结果明显不同。还已经将蛋白质-碳水化合物复合物与其他蛋白质-配体复合物进行了比较。结果我们汇编了结合区和非结合区的氨基酸组成的统计数据,以及一般的以及每个不同的二级结构构象。已经计算了20种残基类型中每一种的结合倾向及其结构特征(例如溶剂可及性,堆积密度和二级结构),以评估其对碳水化合物相互作用的易感性。最后,氨基酸序列的进化谱已用于使用神经网络预测结合位点。使用来自单个序列的信息训练了另一组神经网络,并比较了进化谱和单个序列的预测性能。基于进化网络的信息,基于神经网络的最佳预测可以对所有碳水化合物结合位点以23%的特异性实现87%的预测敏感性。对于同一数据集,单个序列的灵敏度为68%,特异性为55%。有限的半乳糖结合数据集的敏感性和特异性对于进化信息分别为63%和79%,对于单个序列的敏感性和特异性分别为62%和68%。碳水化合物结合位点的倾向性和其他序列及结构特征也已与我们对DNA结合蛋白以及蛋白-配体复合物的类似广泛研究进行了比较。结论碳水化合物通常表现出优先结合芳香残基和最主要的色氨酸的能力。结合位点的较高暴露表面积表明疏水相互作用的作用。神经网络的预测取得了一定程度的成功,预计将来会出现更多蛋白质-碳水化合物复合物的结构时,这种预测会有所改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号