首页> 外文期刊>Bioinformatics >Likelihood-based complex trait association testing for arbitrary depth sequencing data
【24h】

Likelihood-based complex trait association testing for arbitrary depth sequencing data

机译:基于似然性的复杂性状关联测试,用于任意深度测序数据

获取原文
获取原文并翻译 | 示例
           

摘要

In next generation sequencing (NGS)-based genetic studies, researchers typically perform genotype calling first and then apply standard genotype-based methods for association testing. However, such a two-step approach ignores genotype calling uncertainty in the association testing step and may incur power loss and/or inflated type-I error. In the recent literature, a few robust and efficient likelihood based methods including both likelihood ratio test (LRT) and score test have been proposed to carry out association testing without intermediate genotype calling. These methods take genotype calling uncertainty into account by directly incorporating genotype likelihood function (GLF) of NGS data into association analysis. However, existing LRT methods are computationally demanding or do not allow covariate adjustment; while existing score tests are not applicable to markers with low minor allele frequency (MAF). We provide an LRT allowing flexible covariate adjustment, develop a statistically more powerful score test and propose a combination strategy (UNC combo) to leverage the advantages of both tests. We have carried out extensive simulations to evaluate the performance of our proposed LRT and score test. Simulations and real data analysis demonstrate the advantages of our proposed combination strategy: it offers a satisfactory trade-off in terms of computational efficiency, applicability (accommodating both common variants and variants with low MAF) and statistical power, particularly for the analysis of quantitative trait where the power gain can be up to similar to 60% when the causal variant is of low frequency (MAF 0.01).
机译:在基于下一代测序(NGS)的遗传研究中,研究人员通常会先执行基因型调用,然后再应用基于标准基因型的方法进行关联测试。但是,这种两步方法忽略了关联测试步骤中基因型调用的不确定性,并可能导致功率损耗和/或I型错误增大。在最近的文献中,已经提出了一些鲁棒且有效的基于似然性的方法,包括似然比检验(LRT)和得分检验,以进行关联检验而无需中间基因型调用。这些方法通过将NGS数据的基因型似然函数(GLF)直接纳入关联分析来考虑基因型调用不确定性。但是,现有的LRT方法对计算要求很高,或者不允许进行协变量调整。而现有的分数测试不适用于低等位基因频率(MAF)的标记。我们提供了一种LRT,可以灵活地进行协变量调整,开发统计上更强大的得分测试,并提出一种组合策略(UNC组合)以利用两种测试的优势。我们已经进行了广泛的模拟,以评估我们提出的轻轨和分数测试的性能。仿真和真实数据分析证明了我们提出的组合策略的优点:在计算效率,适用性(适应常见变体和具有低MAF的变体)和统计能力方面,它提供了令人满意的折衷,尤其是对于定量性状的分析当因果变量为低频时(MAF <0.01),功率增益可高达60%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号