首页> 美国卫生研究院文献>Cambridge Open Access >Comparison of different imputation methods from low- to high-density panels using Chinese Holstein cattle
【2h】

Comparison of different imputation methods from low- to high-density panels using Chinese Holstein cattle

机译:从中国荷斯坦牛的低密度板到高密度板的不同插补方法的比较

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Imputation of high-density genotypes from low- or medium-density platforms is a promising way to enhance the efficiency of whole-genome selection programs at low cost. In this study, we compared the efficiency of three widely used imputation algorithms (fastPHASE, BEAGLE and findhap) using Chinese Holstein cattle with Illumina BovineSNP50 genotypes. A total of 2108 cattle were randomly divided into a reference population and a test population to evaluate the influence of the reference population size. Three bovine chromosomes, BTA1, 16 and 28, were used to represent large, medium and small chromosome size, respectively. We simulated different scenarios by randomly masking 20%, 40%, 80% and 95% single-nucleotide polymorphisms (SNPs) on each chromosome in the test population to mimic different SNP density panels. Illumina Bovine3K and Illumina BovineLD (6909 SNPs) information was also used. We found that the three methods showed comparable accuracy when the proportion of masked SNPs was low. However, the difference became larger when more SNPs were masked. BEAGLE performed the best and was most robust with imputation accuracies >90% in almost all situations. fastPHASE was affected by the proportion of masked SNPs, especially when the masked SNP rate was high. findhap ran the fastest, whereas its accuracies were lower than those of BEAGLE but higher than those of fastPHASE. In addition, enlarging the reference population improved the imputation accuracy for BEAGLE and findhap, but did not affect fastPHASE. Considering imputation accuracy and computational requirements, BEAGLE has been found to be more reliable for imputing genotypes from low- to high-density genotyping platforms.
机译:从低密度或中等密度平台插入高密度基因型是一种有希望以低成本提高全基因组选择程序效率的方法。在这项研究中,我们比较了使用具有Illumina BovineSNP50基因型的中国荷斯坦牛的三种广泛使用的插补算法(fastPHASE,BEAGLE和findhap)的效率。将总共​​2108头牛随机分为参考种群和测试种群,以评估参考种群规模的影响。使用三个牛染色体BTA1、16和28分别代表大,中和小染色体大小。我们通过随机屏蔽测试群体中每个染色体上的20%,40%,80%和95%的单核苷酸多态性(SNP)来模拟不同的SNP密度面板,从而模拟不同的场景。还使用了Illumina Bovine3K和Illumina BovineLD(6909 SNP)信息。我们发现,当被掩盖的SNP的比例较低时,这三种方法显示出相当的准确性。但是,当更多的SNP被掩盖时,差异变得更大。在几乎所有情况下,BEAGLE的插补精度均> 90%,表现最佳,性能最强。 fastPHASE受掩蔽SNP的比例的影响,尤其是当掩蔽SNP的比率较高时。 findhap的运行速度最快,但其精度低于BEAGLE,但高于fastPHASE。此外,增加参考群体可以提高BEAGLE和findhap的插补精度,但不会影响fastPHASE。考虑到插补精度和计算要求,发现BEAGLE对于从低密度基因分型平台到高密度基因分型平台的基因型插值更为可靠。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号