...
首页> 外文期刊>The Korean journal of chemical engineering >A Gene Clustering Method with Masking Cross-matching Fragments Using Modified Suffix Tree Clustering Method
【24h】

A Gene Clustering Method with Masking Cross-matching Fragments Using Modified Suffix Tree Clustering Method

机译:基于修饰后缀树聚类的具有交叉匹配片段的基因聚类方法

获取原文
获取原文并翻译 | 示例
           

摘要

Multiple sequence alignment is a method for comparing two or more DNA or protein sequences. Most multiple sequence alignment methods rely on pairwise alignment and Smith-Waterman algorithm [Needleman and Wunsch, 1970; Smith and Waterman, 1981] to generate an alignment hierarchy. Therefore, as the number of sequences increases, the runtime increases exponentially. To resolve this problem, this paper presents a multiple sequence alignment method using a parallel processing suffix tree algorithm to search for common subsequences at one time without pairwise alignment. The cross-matched subsequences among the searched common subsequences may be generated and those cause inexact-matching. So the procedure of masking cross-matching pairs was suggested in this study. The proposed method, improved STC (Suffix Tree Clustering), is summarized as follows: (1) construction of suffix tree; (2) search and overlap of common subsequences; (3) grouping of subsequence pairs; (4) masking of cross-matching pairs; and (5) clustering of gene sequences. The new method was successfully evaluated with 23 genes in Mus musculus and 22 genes in three species, clustering nine and eight clusters, respectively.
机译:多序列比对是比较两个或多个DNA或蛋白质序列的方法。大多数多重序列比对方法依赖于成对比对和Smith-Waterman算法[Needleman and Wunsch,1970; Smith and Waterman,1981]生成路线层次。因此,随着序列数量的增加,运行时间呈指数增长。为了解决这个问题,本文提出了一种多序列比对方法,该方法使用并行处理后缀树算法来一次搜索公共子序列而无需成对比对。可能会在搜索到的公共子序列中生成交叉匹配的子序列,而这些交叉序列会导致不完全匹配。因此,本研究提出了屏蔽交叉匹配对的步骤。所提出的改进后的STC(后缀树聚类)方法概括如下:(1)后缀树的构造; (2)常见子序列的搜索和重叠; (3)子序列对的分组; (4)交叉匹配对的屏蔽; (5)基因序列的聚类。该新方法已经成功地用小家鼠中的23个基因和三个物种中的22个基因成功地进行了评估,分别聚成9个和8个聚类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号