首页> 外文会议>2003 Nanotechnology Conference and Trade Show Nanotech 2003 Vol.1 Feb 23-27, 2003 California, USA >A Computational Efficient Algorithm for Protein Sequence Classification
【24h】

A Computational Efficient Algorithm for Protein Sequence Classification

机译:一种蛋白质序列分类的高效计算算法

获取原文
获取原文并翻译 | 示例

摘要

In this paper we present statistical algorithms to classify the stability of proteins by their sequence. A protein sequence consists of successive amino acid codes and can be considered as multivariate categorical data. Based on the statistical variance analysis for data set in each group (stable or unstable protein), the weights are calculated and become an important clue for the effects of the combination of amino acids codes on protein stability. Once the weights for every combination of amino acid codes have been decided, we can assign each protein a score presenting its stability. The distribution of the score for a stable protein is different from the score of an unstable protein. Our algorithm is well suit in the protein stability analysis by its sequence. We propose weighting algorithms and compare them as the results of protein stability classification. It provides an alternative for the protein stability classification and a predictable result as the reference before the protein mutation.
机译:在本文中,我们提出了统计算法,可以根据蛋白质序列的稳定性对蛋白质进行分类。蛋白质序列由连续的氨基酸代码组成,可以视为多元分类数据。基于每个组(稳定或不稳定蛋白质)中数据集的统计方差分析,计算权重并成为氨基酸代码组合对蛋白质稳定性影响的重要线索。一旦确定了每种氨基酸代码组合的权重,我们就可以为每种蛋白质分配一个表示其稳定性的分数。稳定蛋白的分数分布与不稳定蛋白的分数不同。我们的算法通过其序列非常适合蛋白质稳定性分析。我们提出加权算法,并将其作为蛋白质稳定性分类的结果进行比较。它提供了蛋白质稳定性分类的替代方法,并提供了可预测的结果,作为蛋白质突变前的参考。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号