首页> 外文期刊>Journal of Bioinformatics and Computational Biology >DESIGN AND ANALYSIS OF AN EFFICIENT RECURSIVE LINKING ALGORITHM FOR CONSTRUCTING LIKELIHOOD BASED GENETIC MAPS FOR A LARGE NUMBER OF MARKERS
【24h】

DESIGN AND ANALYSIS OF AN EFFICIENT RECURSIVE LINKING ALGORITHM FOR CONSTRUCTING LIKELIHOOD BASED GENETIC MAPS FOR A LARGE NUMBER OF MARKERS

机译:一种有效的递归链接算法的构建,用于基于标记的大量基因的遗传图谱

获取原文
获取原文并翻译 | 示例
           

摘要

A multi-locus likelihood of a genetic map is computed based on a mathematical model of chromatid exchange in meiosis that accounts for any type of bivalent configuration in a genetic interval in any specified order of genetic markers. The computational problem is to calculate the likelihood (L) and maximize L by choosing an ordering of genetic markers on the map and the recombination distances between markers. This maximum likelihood estimate (MLE) could be found either with a straightforward algorithm or with the proposed recursive linking algorithm that implements the likelihood computation process involving an iterative procedure is called Expectation Maximization (EM). The time complexity of the straightforward algorithm is exponential without bound in the number of genetic markers, and implementation of the model with a straightforward algorithm for more than seven genetic markers is not feasible, thus motivating the critical importance of the proposed recursive linking algorithm. The recursive linking algorithm decomposes the pool of genetic markers into segments and renders the model implementable for hundreds of genetic markers. The recursive algorithm is shown to reduce the order of time complexity from exponential to linear in the number of markers. The improvement in time complexity is shown theoretically by a worst-case analysis of the algorithm and supported by run time results using data on linkage group-II of the fungal genome Neurospora crassa.
机译:基于减数分裂中染色单体交换的数学模型计算遗传图谱的多位点可能性,该数学模型解释了遗传区间中遗传标志中任何指定顺序的任何类型的二价构型。计算问题是通过选择图上遗传标记的顺序和标记之间的重组距离来计算似然(L)并使L最大化。可以通过简单的算法或提议的递归链接算法找到这种最大似然估计(MLE),该算法实现涉及迭代过程的似然计算过程,称为期望最大化(EM)。简单算法的时间复杂度是成指数的,并且不受遗传标记数量的限制,并且无法使用针对七个以上遗传标记的简单算法来实现模型,因此激发了所提出的递归链接算法的至关重要性。递归链接算法将遗传标记库分解为片段,并使该模型可用于数百种遗传标记。所示的递归算法可将时间复杂度的顺序从标记数减少到指数级。理论上最差情况的分析显示了时间复杂度的提高,并且使用真菌基因组Neurospora crassa的连接基团II上的数据在运行时结果中得到了支持。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号