...
首页> 外文期刊>Computational Biology and Bioinformatics, IEEE/ACM Transactions on >Reliable and Fast Estimation of Recombination Rates by Convergence Diagnosis and Parallel Markov Chain Monte Carlo
【24h】

Reliable and Fast Estimation of Recombination Rates by Convergence Diagnosis and Parallel Markov Chain Monte Carlo

机译:通过收敛诊断和并行马尔可夫链蒙特卡洛方法可靠,快速地估算重组率

获取原文
获取原文并翻译 | 示例
           

摘要

Genetic recombination is an essential event during the process of meiosis resulting in an exchange of segments between paired chromosomes. Estimating recombination rate is crucial for understanding the process of recombination. Experimental methods are normally difficult and limited to small scale estimations. Thus statistical methods using population genetics data are important for large-scale analysis. LDhat is an extensively used statistical method using rjMCMC algorithm to predict recombination rates. Due to the complexity of rjMCMC scheme, LDhat may take a long time for large SNP data sets. In addition, rjMCMC parameters should be manually defined in the original program which directly impact results. To address these issues, we designed an improved algorithm based on LDhat implementing MCMC convergence diagnostic algorithms to automatically predict values of parameters and monitor the mixing process. Then parallel computation methods were employed to further accelerate the new program. The new algorithms have been tested on ten samples from HapMap phase 2 data set. The results were compared with previous code and showed nearly identical output. However, our new methods achieved significant acceleration proving that they are more efficient and reliable for the estimation of recombination rates. The stand-alone package is freely available for download http://www.ntu.edu.sg/home/zhengjie/software/CPLDhat.
机译:遗传重组是减数分裂过程中的重要事件,导致配对染色体之间的片段交换。估计重组率对于理解重组过程至关重要。实验方法通常很困难,并且仅限于小规模估算。因此,使用种群遗传学数据的统计方法对于大规模分析很重要。 LDhat是使用rjMCMC算法预测重组率的广泛使用的统计方法。由于rjMCMC方案的复杂性,对于大型SNP数据集,LDhat可能会花费很长时间。此外,应该在原始程序中手动定义rjMCMC参数,这些参数会直接影响结果。为了解决这些问题,我们设计了一种基于LDhat的改进算法,该算法实现了MCMC收敛诊断算法,可以自动预测参数值并监控混合过程。然后采用并行计算方法来进一步加速新程序。新算法已经在HapMap第二阶段数据集中的十个样本上进行了测试。将结果与以前的代码进行比较,并显示几乎相同的输出。但是,我们的新方法获得了显着的加速,证明了它们对于重组率的估计更加有效和可靠。该独立软件包可免费下载http://www.ntu.edu.sg/home/zhengjie/software/CPLDhat。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号