首页> 外文期刊>Systematic Biology >Cophylogeny Reconstruction via an Approximate Bayesian Computation
【24h】

Cophylogeny Reconstruction via an Approximate Bayesian Computation

机译:通过近似贝叶斯计算进行系统发生重建

获取原文
获取原文并翻译 | 示例
           

摘要

Despite an increasingly vast literature on cophylogenetic reconstructions for studying host-parasite associations, understanding the common evolutionary history of such systems remains a problem that is far from being solved. Most algorithms for host-parasite reconciliation use an event-based model, where the events include in general (a subset of) cospeciation, duplication, loss, and host switch. All known parsimonious event-based methods then assign a cost to each type of event in order to find a reconstruction of minimum cost. The main problem with this approach is that the cost of the events strongly influences the reconciliation obtained. Some earlier approaches attempt to avoid this problem by finding a Pareto set of solutions and hence by considering event costs under some minimization constraints. To deal with this problem, we developed an algorithm, called Coala, for estimating the frequency of the events based on an approximate Bayesian computation approach. The benefits of this method are 2-fold: (i) it provides more confidence in the set of costs to be used in a reconciliation, and (ii) it allows estimation of the frequency of the events in cases where the data set consists of trees with a large number of taxa. We evaluate our method on simulated and on biological data sets. We show that in both cases, for the same pair of host and parasite trees, different sets of frequencies for the events lead to equally probable solutions. Moreover, often these solutions differ greatly in terms of the number of inferred events. It appears crucial to take this into account before attempting any further biological interpretation of such reconciliations. More generally, we also show that the set of frequencies can vary widely depending on the input host and parasite trees. Indiscriminately applying a standard vector of costs may thus not be a good strategy.
机译:尽管关于用于研究寄主-寄生虫关联的系统进化重建的文献越来越多,但是了解这种系统的共同进化史仍然是一个尚未解决的问题。大多数用于宿主-寄生虫对帐的算法都使用基于事件的模型,其中事件通常包括共同指定,复制,丢失和宿主切换(的子集)。然后,所有已知的基于简约事件的方法都将成本分配给每种类型的事件,以便找到最小成本的重构。这种方法的主要问题是事件的成本强烈影响获得的对帐。一些较早的方法试图通过找到一组Pareto解决方案并因此通过在某些最小化约束下考虑事件成本来避免此问题。为了解决这个问题,我们开发了一种称为Coala的算法,用于基于近似贝叶斯计算方法来估计事件的频率。这种方法的好处有两方面:(i)在对帐中使用的一组成本中提供了更多的信心,并且(ii)在数据集包括以下内容的情况下,可以估计事件的发生频率有大量分类单元的树木。我们根据模拟和生物学数据集评估我们的方法。我们表明,在两种情况下,对于同一对宿主树和寄生树,事件的不同频率集导致同样可能的解决方案。而且,这些解决方案通常在推断事件的数量方面有很大差异。在尝试对这种和解进行任何进一步的生物学解释之前,考虑到这一点似乎至关重要。更一般地说,我们还表明,频率集可以根据输入宿主和寄生树而变化很大。因此,不加选择地使用标准成本向量可能不是一个好的策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号