首页> 美国卫生研究院文献>other >A Sparse Marker Extension Tree Algorithm for Selecting the Best Set of Haplotype Tagging Single Nucleotide Polymorphisms
【2h】

A Sparse Marker Extension Tree Algorithm for Selecting the Best Set of Haplotype Tagging Single Nucleotide Polymorphisms

机译:选择单倍型标记单核苷酸多态性最佳组的稀疏标记扩展树算法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Single nucleotide polymorphisms (SNPs) play a central role in the identification of susceptibility genes for common diseases. Recent empirical studies on human genome have revealed block-like structures, and each block contains a set of haplotype tagging SNPs (htSNPs) that capture a large fraction of the haplotype diversity. Herein, we present an innovative sparse marker extension tree (SMET) algorithm to select optimal htSNP set(s). SMET reduces the search space considerably (compared to full enumeration strategy), therefore improves computing efficiency. We tested this algorithm on several datasets at three different genomic scales: (1) gene-wide (NOS3, CRP, IL6 PPARA, and TNF), (2) region-wide (a Whitehead Institute's inflammatory bowel disease dataset and a UK Graves’ disease dataset), and (3) chromosome-wide (chromosome 22) levels. SMET offers geneticists with greater flexibilities in SNP tagging than lossless methods with adjustable haplotype diversity coverage (ϕ). In simulation studies, we found that (1) an initial sample size of 50 individuals (100 chromosomes) or more is needed for htSNP selection; (2) the SNP tagging strategy is considerably more efficient when the underlying block structure is taken into account; and (3) htSNP sets at 80−90% ϕ are more cost-effective than the lossless sets in term of statistical power, relative risk ratio estimation, and genotyping efforts. Our study suggests that the novel SMET algorithm is a valuable tool for association tests.
机译:单核苷酸多态性(SNPs)在常见疾病易感基因的鉴定中起着核心作用。关于人类基因组的最新实证研究已经揭示了块状结构,并且每个块包含一组捕获大部分单倍型多样性的单倍型标记SNP(htSNP)。在这里,我们提出了一种创新的稀疏标记扩展树(SMET)算法,以选择最佳的htSNP集。 SMET大大减少了搜索空间(与完整的枚举策略相比),因此提高了计算效率。我们在三种不同基因组规模的几个数据集上测试了该算法:(1)全基因(NOS3,CRP,IL6 PPARA和TNF),(2)全地区(怀特海德研究所的炎症性肠病数据集和UK Graves'疾病数据集),以及(3)整个染色体(22号染色体)水平。与无损方法(具有可调节的单倍型多样性覆盖率)相比,SMET为遗传学家提供了更大的灵活性,可用于SNP标记。在模拟研究中,我们发现(1)htSNP选择需要50个个体(100个染色体)或更多的初始样本大小; (2)当考虑到底层的块结构时,SNP标记策略的效率会大大提高; (3)在统计能力,相对风险比估计和基因分型方面,htSNP设置为80-90%than比无损设置更具成本效益。我们的研究表明,新颖的SMET算法是用于关联测试的有价值的工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号