首页> 外文会议>Asia-Pacific Bioinformatics Conference >COMPUTING THE ASSIGNMENT OF ORTHOLOGOUS GENES VIA GENOME REARRANGEMENT
【24h】

COMPUTING THE ASSIGNMENT OF ORTHOLOGOUS GENES VIA GENOME REARRANGEMENT

机译:通过基因组重排计算正交基因的分配

获取原文

摘要

The assignment of orthologous genes between a pair of genomes is a fundamental and challenging problem in comparative genomics. Existing methods that assign orthologs based on the similarity between DNA or protein sequences may make erroneous assignments when sequence similarity does not clearly delineate the evolutionary relationship among genes of the same families. In this paper, we present a new approach to ortholog assignment that takes into account both sequence similarity and evolutionary events at genome level, where orthologous genes are assumed to correspond to each other in the most parsimonious evolving scenario under genome rearrangement. It is then formulated as a problem of computing the signed reversal distance with duplicates between two genomes of interest, for which an efficient heuristic algorithm was given by introducing two new optimization problems, minimum common partition and maximum cycle decomposition. Following this approach, we have implemented a high-throughput systemfor assigning orthologs on a genome scale, called SOAR, and tested it on both simulated data and real genome sequence data. Compared to a recent ortholog assignment method based entirely on homology search (called INPARANOID), SOAR shows a marginally better performance in terms of sensitivity on the real data set because it was able to identify several, correct orthologous pairs that were missed by INPARANOID. The simulation results demonstrate that SOAR in genera] performs better than the iterated exemplar algorithm in terms of computing the reversal distance and assigning correct orthologs.
机译:一对基因组之间的正非基因的分配是比较基因组学中的基本和挑战性问题。当序列相似性没有明确描绘同一家庭的基因之间的进化关系时,将基于DNA或蛋白质序列之间的相似性分配外科的现有方法可能会产生错误的任务。在本文中,我们提出了一种新的前端分配方法,所述直接分配考虑到基因组水平的序列相似性和进化事件,其中在基因组重新排列下的最令人奇异的演化场景中,假设正交基因彼此对应。然后将其制定为计算符号逆转距离的问题,其中两个感兴趣的两个基因组之间的重复,通过引入两个新的优化问题,最小的公共分区和最大循环分解来给出高效的启发式算法。在这种方法之后,我们已经实现了一种在基因组规模上分配外翻的高吞吐量系统,称为SOAR,并在模拟数据和真实基因组序列数据上测试它。与最近基于同源性搜索(称为inparanoid)的最近的初始分配方法相比,SOAR在真实数据集的敏感性方面显示了略微更好的性能,因为它能够识别互相遗传的几个,正确的正交成对。仿真结果表明,在计算逆转距离和分配正确的正轨时,它们在Genera中的飙升比迭代的示例算法更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号