首页> 外文期刊>BMC Bioinformatics >A sparse autoencoder-based deep neural network for protein solvent accessibility and contact number prediction
【24h】

A sparse autoencoder-based deep neural network for protein solvent accessibility and contact number prediction

机译:基于稀疏自动编码器的深度神经网络,用于蛋白质溶剂的可及性和接触数预测

获取原文
           

摘要

Direct prediction of the three-dimensional (3D) structures of proteins from one-dimensional (1D) sequences is a challenging problem. Significant structural characteristics such as solvent accessibility and contact number are essential for deriving restrains in modeling protein folding and protein 3D structure. Thus, accurately predicting these features is a critical step for 3D protein structure building. In this study, we present DeepSacon, a computational method that can effectively predict protein solvent accessibility and contact number by using a deep neural network, which is built based on stacked autoencoder and a dropout method. The results demonstrate that our proposed DeepSacon achieves a significant improvement in the prediction quality compared with the state-of-the-art methods. We obtain 0.70 three-state accuracy for solvent accessibility, 0.33 15-state accuracy and 0.74 Pearson Correlation Coefficient (PCC) for the contact number on the 5729 monomeric soluble globular protein dataset. We also evaluate the performance on the CASP11 benchmark dataset, DeepSacon achieves 0.68 three-state accuracy and 0.69 PCC for solvent accessibility and contact number, respectively. We have shown that DeepSacon can reliably predict solvent accessibility and contact number with stacked sparse autoencoder and a dropout approach.
机译:从一维(1D)序列直接预测蛋白质的三维(3D)结构是一个具有挑战性的问题。重要的结构特征(例如溶剂可及性和接触数)对于推导对蛋白质折叠和蛋白质3D结构建模的约束至关重要。因此,准确预测这些特征是3D蛋白质结构构建的关键步骤。在这项研究中,我们提出了DeepSacon,这是一种可通过使用深度神经网络有效预测蛋白质溶剂可及性和接触数的计算方法,该方法基于堆叠式自动编码器和辍学方法而构建。结果表明,与最新方法相比,我们提出的DeepSacon大大提高了预测质量。在5729个单体可溶性球状蛋白数据集上,我们获得的接触状态三态准确度为0.70,十五种状态准确度为0.33,皮尔逊相关系数(PCC)为0.74。我们还评估了CASP11基准数据集的性能,DeepSacon的溶剂可及性和接触数分别达到0.68三态精度和0.69 PCC。我们已经证明,DeepSacon可以使用堆叠式稀疏自动编码器和压差方法可靠地预测溶剂的可及性和接触数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号