首页> 外文期刊>The American Journal of Human Genetics >Imputing Phenotypes for Genome-wide Association Studies
【24h】

Imputing Phenotypes for Genome-wide Association Studies

机译:为全基因组关联研究估算表型

获取原文
获取原文并翻译 | 示例
           

摘要

Genome-wide association studies (GWASs) have been successful in detecting variants correlated with phenotypes of clinical interest. However, the power to detect these variants depends on the number of individuals whose phenotypes are collected, and for phenotypes that are difficult to collect, the sample size might be insufficient to achieve the desired statistical power. The phenotype of interest is often difficult to collect, whereas surrogate phenotypes or related phenotypes are easier to collect and have already been collected in very large samples. This paper demonstrates how we take advantage of these additional related phenotypes to impute the phenotype of interest or target phenotype and then perform association analysis. Our approach leverages the correlation structure between phenotypes to perform the imputation. The correlation structure can be estimated from a smaller complete dataset for which both the target and related phenotypes have been collected. Under some assumptions, the statistical power can be computed analytically given the correlation structure of the phenotypes used in imputation. In addition, our method can impute the summary statistic of the target phenotype as a weighted linear combination of the summary statistics of related phenotypes. Thus, our method is applicable to datasets for which we have access only to summary statistics and not to the raw genotypes. We illustrate our approach by analyzing associated loci to triglycerides (TGs), body mass index (BMI), and systolic blood pressure (SBP) in the Northern Finland Birth Cohort dataset.
机译:全基因组关联研究(GWASs)已成功检测与临床表型相关的变异。但是,检测这些变体的能力取决于收集其表型​​的个体数量,对于难以收集的表型,样本量可能不足以实现所需的统计能力。感兴趣的表型通常很难收集,而替代表型或相关表型更容易收集,并且已经在非常大的样本中收集到。本文演示了我们如何利用这些附加的相关表型来估算目标表型或目标表型,然后执行关联分析。我们的方法利用表型之间的相关结构来执行归因。可以从较小的完整数据集中估算相关结构,为此已经收集了目标表型和相关表型。在某些假设下,可以在推算中使用的表型的相关结构的基础上,通过分析来计算统计功效。此外,我们的方法可以将目标表型的摘要统计量估算为相关表型的摘要统计量的加权线性组合。因此,我们的方法适用于只能访问摘要统计信息而不能访问原始基因型的数据集。我们通过分析与芬兰北部出生队列数据集中的甘油三酸酯(TGs),体重指数(BMI)和收缩压(SBP)相关的基因座来说明我们的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号