首页> 美国卫生研究院文献>Algorithms for Molecular Biology : AMB >Unrooted unordered homeomorphic subtree alignment of RNA trees
【2h】

Unrooted unordered homeomorphic subtree alignment of RNA trees

机译:RNA树的无根无序同胚子树比对

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We generalize some current approaches for RNA tree alignment, which are traditionally confined to ordered rooted mappings, to also consider unordered unrooted mappings. We define the Homeomorphic Subtree Alignment problem (HSA), and present a new algorithm which applies to several modes, combining global or local, ordered or unordered, and rooted or unrooted tree alignments. Our algorithm generalizes previous algorithms that either solved the problem in an asymmetric manner, or were restricted to the rooted and/or ordered cases. Focusing here on the most general unrooted unordered case, we show that for input trees T and S, our algorithm has an O(nTnS + min(dT,dS)LTLS) time complexity, where nT,LT and dT are the number of nodes, the number of leaves, and the maximum node degree in T, respectively (satisfying dT ≤ LT ≤ nT), and similarly for nS,LS and dS with respect to the tree S. This improves the time complexity of previous algorithms for less general variants of the problem.In order to obtain this time bound for HSA, we developed new algorithms for a generalized variant of the Min-Cost Bipartite Matching problem (MCM), as well as to two derivatives of this problem, entitled All-Cavity-MCM and All-Pairs-Cavity-MCM. For two input sets of size n and m, where n ≤ m, MCM and both its cavity derivatives are solved in O(n3 + nm) time, without the usage of priority queues (e.g. Fibonacci heaps) or other complex data structures. This gives the first cubic time algorithm for All-Pairs-Cavity-MCM, and improves the running times of MCM and All-Cavity-MCM problems in the unbalanced case where n ≪ m.We implemented the algorithm (in all modes mentioned above) as a graphical software tool which computes and displays similarities between secondary structures of RNA given as input, and employed it to a preliminary experiment in which we ran all-against-all inter-family pairwise alignments of RNAse P and Hammerhead RNA family members, exposing new similarities which could not be detected by the traditional rooted ordered alignment approaches. The results demonstrate that our approach can be used to expose structural similarity between some RNAs with higher sensitivity than the traditional rooted ordered alignment approaches. Source code and web-interface for our tool can be found in .
机译:我们概括了一些当前用于RNA树比对的方法,这些方法传统上局限于有序的根映射,也要考虑无序的无根映射。我们定义了同胚子树对齐问题(HSA),并提出了一种适用于多种模式的新算法,将全局或局部,有序或无序,有根或无根的树对齐方式组合在一起。我们的算法概括了以前的算法,这些算法要么以非对称方式解决了问题,要么被限制在有根和/或有序情况下。在这里集中于最普通的无根无序情况,我们表明对于输入树T和S,我们的算法具有O(nTnS + min(dT,dS)L T L S )时间复杂度,其中 n T L T d T 分别是 T 中的节点数,叶子数和最大节点度(满足 d T L T n T ),对于 n S L S d S 到树 S 。这为问题的较不通用变体提高了先前算法的时间复杂度。为了获得HSA的时间限制,我们针对 Min-Cost Bipartite Matching 问题的广义变体开发了新算法( MCM ),以及该问题的两个派生词,分别是 All-Cavity-MCM All-Pairs-Cavity-MCM 。对于两个大小为 n m 的输入集,其中 n m MCM 及其两个腔导数都在 O n 3 + n m )时间,而无需使用优先级队列(例如Fibonacci堆)或其他复杂的数据结构。这为 All - Pairs -腔体- MCM 提供了第一个立方时间算法,并缩短了在 n ≪的不平衡情况下, MCM All -- MCM 问题 m 。我们将算法(在上述所有模式下)作为一种图形软件工具实施,该工具可以计算并显示作为输入的RNA二级结构之间的相似性,并将其用于初步实验中RNA酶P和Hammerhead RNA家族成员的所有家族之间的成对比对,揭示了传统的有根有序比对方法无法检测到的新相似性。结果表明,与传统的根有序-比对方法相比,我们的方法可用于更高灵敏度地揭示某些RNA之间的结构相似性。可以在中找到我们工具的源代码和Web界面。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号