首页> 外文期刊>Journal of information technology research >Novel PSSM-Based Approaches for Gene Identification Using Support Vector Machine
【24h】

Novel PSSM-Based Approaches for Gene Identification Using Support Vector Machine

机译:基于PSSM的基因识别方法使用支持向量机

获取原文
获取原文并翻译 | 示例
           

摘要

By understanding the function of each protein encoded in genome, the molecular mechanism of the cell can be recognized. In genome annotation field, several methods or techniques have been developed to locate or predict the patterns of genes in genome sequence. However, recognizing corresponding gene of a given protein sequence using conventional tools is inherently complicated and error prone. This paper first focuses on the issue of gene prediction and its challenges. The authors then present a novel method for identifying genes that involves a two-step process. First the research presents new features extracted from protein sequences using a position specific scoring matrix (PSSM). The PSSM profiles are converted into uniform numeric representation. Then, a new structured approach has been applied on PSSM vector which uses a decision tree-based technique for obtaining rules. Finally, the rules of single class are joined together to form a matrix which is then given as an input to SVM for classification purpose. The rules derived from algorithm correspond to genes. The authors also introduce another approach for predicting genes based on PSSM using SVM. Both the methods have been implemented on genome DNAset dataset. Empirical evaluation shows that PSSM based SAFARI approach produces better results.
机译:通过了解基因组中编码的每种蛋白质的功能,可以识别细胞的分子机制。在基因组注释场中,已经开发了几种方法或技术以定位或预测基因组序列中基因的模式。然而,使用常规工具识别给定蛋白质序列的相应基因是固有的复杂和误差。本文首先侧重于基因预测问题及其挑战。然后提出了一种用于识别涉及两步过程的基因的新方法。首先,研究呈现了使用位置特异性评分矩阵(PSSM)从蛋白质序列中提取的新特征。 PSSM配置文件转换为统一的数字表示。然后,在PSSM向量上应用了一种新的结构化方法,该方法使用基于决策树的技术来获取规则。最后,单类的规则连接在一起以形成矩阵,然后将其作为输入到SVM的输入,以进行分类目的。源自算法的规则对应于基因。作者还介绍了一种使用SVM基于PSSM预测基因的另一种方法。这两种方法都在基因组DNASET数据集上实现。实证评估表明,基于PSSM的野生动物园方法会产生更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号