A Computational Efficient Algorithm for Protein Sequence Classification

机译：一种蛋白质序列分类的高效计算算法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present statistical algorithms to classify the stability of proteins by their sequence. A protein sequence consists of successive amino acid codes and can be considered as multivariate categorical data. Based on the statistical variance analysis for data set in each group (stable or unstable protein), the weights are calculated and become an important clue for the effects of the combination of amino acids codes on protein stability. Once the weights for every combination of amino acid codes have been decided, we can assign each protein a score presenting its stability. The distribution of the score for a stable protein is different from the score of an unstable protein. Our algorithm is well suit in the protein stability analysis by its sequence. We propose weighting algorithms and compare them as the results of protein stability classification. It provides an alternative for the protein stability classification and a predictable result as the reference before the protein mutation.

机译：在本文中，我们提出了统计算法，可以根据蛋白质序列的稳定性对蛋白质进行分类。蛋白质序列由连续的氨基酸代码组成，可以视为多元分类数据。基于每个组（稳定或不稳定蛋白质）中数据集的统计方差分析，计算权重并成为氨基酸代码组合对蛋白质稳定性影响的重要线索。一旦确定了每种氨基酸代码组合的权重，我们就可以为每种蛋白质分配一个表示其稳定性的分数。稳定蛋白的分数分布与不稳定蛋白的分数不同。我们的算法通过其序列非常适合蛋白质稳定性分析。我们提出加权算法，并将其作为蛋白质稳定性分类的结果进行比较。它提供了蛋白质稳定性分类的替代方法，并提供了可预测的结果，作为蛋白质突变前的参考。

著录项

来源
《2003 Nanotechnology Conference and Trade Show Nanotech 2003 Vol.1 Feb 23-27, 2003 California, USA》|2003年|p.24-27|共4页
会议地点 San Francisco CA(US);San Francisco CA(US);San Francisco CA(US);San Francisco CA(US);San Francisco CA(US);San Francisco CA(US);San Francisco CA(US);San Francisco CA(US)
作者
Yiming Li; Hsiao-Mei Lu;
展开▼
作者单位

National Nano Device Laboratories Microelectronics and Information Systems Research Center, National Chiao Tung University P.O. Box 25-178, Hsinchu 300, Taiwan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类一般工业技术;
关键词
protein stability; classification of protein sequence; prediction model; statistical analysis; computational statistics;

机译：蛋白质稳定性;蛋白质序列分类;预测模型;统计分析;计算统计;

相似文献

外文文献
中文文献
专利

1. COMPUTATIONAL TECHNIQUE FOR AN EFFICIENT CLASSIFICATION OF PROTEIN SEQUENCES WITH DISTANCE-BASED SEQUENCE ENCODING ALGORITHM [J] . Iqbal Muhammad Javed, Faye Ibrahima, Said Abas Md, Computational Intelligence . 2017,第1期

机译：利用基于距离的序列编码算法对蛋白质序列进行有效分类的计算技术
2. Novel, provable algorithms for efficient ensemble-based computational protein design and their application to the redesign of the c-Raf-RBD:KRas protein-protein interface [J] . Anna U. Lowegard, Marcel S. Frenkel, Graham T. Holt, PLoS Computational Biology . 2020,第6期

机译：基于高效的基于组合的计算蛋白设计及其应用于C-RAF-RBD的重新设计的新颖，可提供的算法：KRAS蛋白蛋白界面
3. A Hardware-Efficient Algorithm for Real-Time Computation of Zadoff–Chu Sequences [J] . Mohammad M. Mansour Journal of Signal Processing Systems . 2013,第2期

机译：Zadoff-Chu序列的实时计算硬件有效算法
4. A Computational Efficient Algorithm for Protein Sequence Classification [C] . Nanotechnology conference and trade show . 2003

机译：一种蛋白质序列分类的计算有效算法
5. A novel face recognition transformational model and its inherent and optimal classification through a computationally efficient statistical algorithm. [D] . Kyperountas, Marios C. 2003

机译：一种新颖的人脸识别转换模型及其通过高效计算的统计算法进行的固有分类和最佳分类。
6. Novel provable algorithms for efficient ensemble-based computational protein design and their application to the redesign of the c-Raf-RBD:KRas protein-protein interface [O] . Anna U. Lowegard, Marcel S. Frenkel, Graham T. Holt, 2020

机译：基于高效的基于组合的计算蛋白设计及其应用于C-RAF-RBD的重新设计的新颖可提供的算法：KRAS蛋白蛋白界面
7. The Bioinformatics Bookshelf: Teach Yourself Computational Biology? Bioinformatics: The Machine Learning Approach By Pierre Baldi and Soren Brunak Cambridge, MA: MIT Press (1998). 351 pp. $40.00; Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins Edited by Andreas D. Baxevanis and B. F. Francis Ouellette New York: Wiley-lnterscience (1998). 370 pp. $59.95; Guide to Human Genome Computing, Second Edition Edited by Martin J. Bishop San Diego, CA: Academic Press (1998). 306 pp. $69.95; Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids By Richard Durbin, Sean Eddy, Anders Krogh, and Graeme Mitchison Cambridge: Cambridge University Press (1998). 356 pp. $34.95; Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology By Dan Gusfield Cambridge: Cambridge University Press (1997). 534 pp. $59.95; Introduction to Computational Molecular Biology By Joao Setubal and Joao Meidanis Boston: PWS Publishing (1997). 296 pp. $61.95 [O] . Pickeral Oxana K, Boguski Mark S 1999

机译：生物信息学书架：自学计算生物学吗？生物信息学：机器学习方法，作者：Pierre Baldi和Soren Brunak剑桥，麻省：麻省理工学院出版社（1998）。 351页，$ 40.00；生物信息学：由Andreas D. Baxevanis和B. F. Francis Ouellette编辑的基因和蛋白质分析实用指南纽约：Wiley-Interscience（1998）。 370页，$ 59.95；《人类基因组计算指南》，第二版，由马丁·J·毕晓普（Martin J. Bishop）编辑，加利福尼亚州圣地亚哥：学术出版社（1998）。 306页，$ 69.95；生物序列分析：蛋白质和核酸的概率模型Richard Durbin，Sean Eddy，Anders Krogh和Graeme Mitchison剑桥：剑桥大学出版社（1998年）。 356页，$ 34.95；字符串，树和序列上的算法：计算机科学和计算生物学Dan Danssfield剑桥：剑桥大学出版社（1997年）。 534页，$ 59.95； Joao Setubal和Joao Meidanis Boston撰写的《计算分子生物学概论》：PWS出版（1997）。 296羽61.95美元

A Computational Efficient Algorithm for Protein Sequence Classification

摘要

著录项

相似文献

相关主题

期刊订阅