首页> 外文期刊>Bioinformatics >Haplotype inference for present-absent genotype data using previously identified haplotypes and haplotype patterns
【24h】

Haplotype inference for present-absent genotype data using previously identified haplotypes and haplotype patterns

机译:使用先前确定的单倍型和单倍型模式对当前基因型数据进行单倍型推断

获取原文
获取原文并翻译 | 示例
           

摘要

Motivation: Killer immunoglobulin-like receptor (KIR) genes vary considerably in their presence or absence on a specific regional haplotype. Because presence or absence of these genes is largely detected using locus-specific genotyping technology, the distinction between homozygosity and hemizygosity is often ambiguous. The performance of methods for haplotype inference (e.g. PL-EM, PHASE) for KIR genes may be compromised due to the large portion of ambiguous data. At the same time, many haplotypes or partial haplotype patterns have been previously identified and can be incorporated to facilitate haplotype inference for unphased genotype data. To accommodate the increased ambiguity of present-absent genotyping of KIR genes, we developed a hybrid approach combining a greedy algorithm with the Expectation-Maximization (EM) method for haplotype inference based on previously identified haplotypes and haplotype patterns.Results: We implemented this algorithm in a software package named HAPLO-IHP (Haplotype inference using identified haplotype patterns) and compared its performance with that of HAPLORE and PHASE on simulated KIR genotypes. We compared five measures in order to evaluate the reliability of haplotype assignments and the accuracy in estimating haplotype frequency. Our method outperformed the two existing techniques by all five measures when either 60% or 25% of previously identified haplotypes were incorporated into the analyses.
机译:动机:杀伤性免疫球蛋白样受体(KIR)基因在特定区域单倍型的存在与否之间差异很大。因为使用基因座特异性基因分型技术可以很大程度上检测到这些基因的存在与否,所以纯合和半合之间的区别通常是模棱两可的。 KIR基因的单倍型推论方法(例如PL-EM,PHASE)的性能可能会因为数据的歧义很大而受到影响。同时,先前已经鉴定了许多单倍型或部分单倍型模式,可以将其合并以促进针对未定相基因型数据的单倍型推断。为了适应目前不存在的KIR基因分型的不确定性,我们开发了一种混合方法,该方法将贪婪算法与期望最大化(EM)方法相结合,用于基于先前确定的单倍型和单倍型模式的单倍型推断。结果:我们实现了该算法在名为HAPLO-IHP的软件包中(使用已识别的单倍型模式进行单倍型推断),并将其与HAPLORE和PHASE在模拟KIR基因型上的性能进行了比较。为了评估单元型分配的可靠性和估计单元型频率的准确性,我们比较了五种测量方法。当将先前确定的单倍型中的60%或25%纳入分析时,我们的方法在所有五项指标上均优于两种现有技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号