首页> 美国卫生研究院文献>PLoS Clinical Trials >Prediction of Thermostability from Amino Acid Attributes by Combination of Clustering with Attribute Weighting: A New Vista in Engineering Enzymes
【2h】

Prediction of Thermostability from Amino Acid Attributes by Combination of Clustering with Attribute Weighting: A New Vista in Engineering Enzymes

机译:通过聚类与属性权重相结合从氨基酸属性预测热稳定性:工程酶领域的新视野

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The engineering of thermostable enzymes is receiving increased attention. The paper, detergent, and biofuel industries, in particular, seek to use environmentally friendly enzymes instead of toxic chlorine chemicals. Enzymes typically function at temperatures below 60°C and denature if exposed to higher temperatures. In contrast, a small portion of enzymes can withstand higher temperatures as a result of various structural adaptations. Understanding the protein attributes that are involved in this adaptation is the first step toward engineering thermostable enzymes. We employed various supervised and unsupervised machine learning algorithms as well as attribute weighting approaches to find amino acid composition attributes that contribute to enzyme thermostability. Specifically, we compared two groups of enzymes: mesostable and thermostable enzymes. Furthermore, a combination of attribute weighting with supervised and unsupervised clustering algorithms was used for prediction and modelling of protein thermostability from amino acid composition properties. Mining a large number of protein sequences (2090) through a variety of machine learning algorithms, which were based on the analysis of more than 800 amino acid attributes, increased the accuracy of this study. Moreover, these models were successful in predicting thermostability from the primary structure of proteins. The results showed that expectation maximization clustering in combination with uncertainly and correlation attribute weighting algorithms can effectively (100%) classify thermostable and mesostable proteins. Seventy per cent of the weighting methods selected Gln content and frequency of hydrophilic residues as the most important protein attributes. On the dipeptide level, the frequency of Asn-Glu was the key factor in distinguishing mesostable from thermostable enzymes. This study demonstrates the feasibility of predicting thermostability irrespective of sequence similarity and will serve as a basis for engineering thermostable enzymes in the laboratory.
机译:热稳定酶的工程设计正受到越来越多的关注。尤其是造纸,清洁剂和生物燃料行业,寻求使用环保的酶代替有毒的氯化学物质。酶通常在低于60°C的温度下起作用,如果暴露于较高温度下会变性。相反,由于各种结构调整,一小部分酶可以承受更高的温度。了解这种适应过程所涉及的蛋白质属性是工程化热稳定酶的第一步。我们采用了各种有监督和无监督的机器学习算法,以及属性加权方法来寻找有助于酶热稳定性的氨基酸组成属性。具体来说,我们比较了两种酶:可降解酶和热稳定酶。此外,将属性加权与有监督和无监督聚类算法相结合,可以根据氨基酸组成特性对蛋白质的热稳定性进行预测和建模。通过对基于800多种氨基酸属性进行分析的各种机器学习算法来挖掘大量蛋白质序列(2090),提高了这项研究的准确性。而且,这些模型成功地从蛋白质的一级结构预测了热稳定性。结果表明,期望最大化聚类与不确定性和相关属性加权算法相结合可以有效地(100%)对热稳定和可降解的蛋白质进行分类。 70%的加权方法选择Gln含量和亲水残基的频率作为最重要的蛋白质属性。在二肽水平上,Asn-Glu的频率是区分可降解的酶和热稳定的酶的关键因素。这项研究证明了预测热稳定性的可行性,而与序列的相似性无关,并将作为实验室中工程化热稳定酶的基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号