首页> 外文期刊>Bioinformatics >The Treeterbi and Parallel Treeterbi algorithms: efficient, optimal decoding for ordinary, generalized and pair HMMs
【24h】

The Treeterbi and Parallel Treeterbi algorithms: efficient, optimal decoding for ordinary, generalized and pair HMMs

机译:Treeterbi和并行Treeterbi算法:针对普通HMM,通用HMM和成对HMM的高效,最佳解码

获取原文
获取原文并翻译 | 示例
           

摘要

Motivation: Hidden Markov models (HMMs) and generalized HMMs been successfully applied to many problems, but the standard Viterbi algorithm for computing the most probable interpretation of an input sequence (known as decoding) requires memory proportional to the length of the sequence, which can be prohibitive. Existing approaches to reducing memory usage either sacrifice optimality or trade increased running time for reduced memory. Results: We developed two novel decoding algorithms, Treeterbi and Parallel Treeterbi, and implemented them in the TWINSCAN/N-SCAN gene-prediction system. The worst case asymptotic space and time are the same as for standard Viterbi, but in practice, Treeterbi optimally decodes arbitrarily long sequences with generalized HMMs in bounded memory without increasing running time. Parallel Treeterbi uses the same ideas to split optimal decoding across processors, dividing latency to completion by approximately the number of available processors with constant average overhead per processor. Using these algorithms, we were able to optimally decode all human chromosomes with N-SCAN, which increased its accuracy relative to heuristic solutions. We also implemented Treeterbi for Pairagon, our pair HMM based cDNA-to-genome aligner. Availability: The TWINSCAN/N-SCAN/PAIRAGON open source software package is available from http://genes.cse.wustl.edu. Contact: brent@cse.wustl.edu.
机译:动机:隐马尔可夫模型(HMM)和广义HMM已成功应用于许多问题,但是用于计算输入序列最可能的解释(称为解码)的标准Viterbi算法需要与序列长度成比例的内存。禁止。现有的减少内存使用的方法要么牺牲了最优性,要么以增加运行时间为代价来减少内存。结果:我们开发了两种新颖的解码算法Treeterbi和Parallel Treeterbi,并在TWINSCAN / N-SCAN基因预测系统中实现了它们。最坏情况下的渐近空间和时间与标准Viterbi相同,但是在实践中,Treeterbi使用有限内存中的广义HMM最佳地解码任意长序列,而不会增加运行时间。并行Treeterbi使用相同的思想在处理器之间分配最佳解码,将完成的等待时间除以可用处理器的数量,每个处理器的平均平均开销不变。使用这些算法,我们能够使用N-SCAN对所有人类染色体进行最佳解码,从而相对于启发式解决方案提高了其准确性。我们还为Pairagon(基于HMM的cDNA至基因组比对仪)实施Treeterbi。可用性:TWINSCAN / N-SCAN / PAIRAGON开源软件包可从http://genes.cse.wustl.edu获得。联系人:brent@cse.wustl.edu。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号