...
首页> 外文期刊>Journal of Biophysical Chemistry >MetalloPred: A tool for hierarchical prediction of metal ion binding proteins using cluster of neural networks and sequence derived features
【24h】

MetalloPred: A tool for hierarchical prediction of metal ion binding proteins using cluster of neural networks and sequence derived features

机译:MetalloPred:使用神经网络簇和序列衍生特征对金属离子结合蛋白进行层次预测的工具

获取原文
           

摘要

Given a protein sequence, how can we identify whether it is a metalloprotein or not? If it is, which main functional class and subclasses it belongs to? This is an important biological question because they are closely related to the biological function of an uncharacterized protein. Particularly, with the avalanche of protein sequences generated in the post genomic era and since conventional techniques are time consuming and expensive, it is highly desirable to develop an automated method by which one can get a fast and accurate answer to these questions. Here, a top-down predictor, called MetalloPred, is developed which consists of 3 level of hierarchical classification using cascade of neural networks from sequence derived features. The 1st layer of the prediction engine is for identifying a query protein as metalloprotein or not; the 2nd layer for the main functional class; and the 3rd layer for the sub-functional class. The overall success rates for all the three layers are higher than 60% that were obtained through rigorous cross-validation tests on the very stringent benchmark datasets in which none of the proteins has 30% sequence identity with any other in the same class or subclass. MetalloPred achieved good prediction accuracies and could nicely complement experimental approaches for identification of metal binding proteins. MetalloPred is freely available to be used in-house as a standalone and is accessible at http://www.juit.ac.in/assets/Metallopred/.
机译:给定一个蛋白质序列,我们如何鉴定它是否是金属蛋白?如果是,它属于哪个主要功能类和子类?这是一个重要的生物学问题,因为它们与未表征蛋白的生物学功能密切相关。特别地,随着在后基因组时代产生的大量蛋白质序列,并且由于常规技术既费时又昂贵,因此非常需要开发一种自动化方法,通过该方法可以快速而准确地回答这些问题。在这里,开发了一种名为MetalloPred的自上而下的预测器,该预测器由3级层次分类组成,其中使用了来自序列衍生特征的级联神经网络。预测引擎的第一层用于识别查询蛋白是否为金属蛋白;主要功能类别的第二层;子功能类的第3层。通过对非常严格的基准数据集进行严格的交叉验证测试获得的三层结果的总体成功率均高于60%,在严格的基准数据集中,没有一种蛋白与同一类或亚类的任何其他蛋白具有30%的序列同一性。 MetalloPred获得了良好的预测准确性,可以很好地补充用于鉴定金属结合蛋白的实验方法。 MetalloPred是免费提供的,可以在内部独立使用,也可以从http://www.juit.ac.in/assets/Metallopred/访问。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号