首页> 美国卫生研究院文献>Heredity >A robust multiple-locus method for quantitative trait locus analysis of non-normally distributed multiple traits
【2h】

A robust multiple-locus method for quantitative trait locus analysis of non-normally distributed multiple traits

机译:用于非正态分布多个性状数量性状基因座分析的鲁棒多位点方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Linear regression-based quantitative trait loci/association mapping methods such as least squares commonly assume normality of residuals. In genetics studies of plants or animals, some quantitative traits may not follow normal distribution because the data include outlying observations or data that are collected from multiple sources, and in such cases the normal regression methods may lose some statistical power to detect quantitative trait loci. In this work, we propose a robust multiple-locus regression approach for analyzing multiple quantitative traits without normality assumption. In our method, the objective function is least absolute deviation (LAD), which corresponds to the assumption of multivariate Laplace distributed residual errors. This distribution has heavier tails than the normal distribution. In addition, we adopt a group LASSO penalty to produce shrinkage estimation of the marker effects and to describe the genetic correlation among phenotypes. Our LAD-LASSO approach is less sensitive to the outliers and is more appropriate for the analysis of data with skewedly distributed phenotypes. Another application of our robust approach is on missing phenotype problem in multiple-trait analysis, where the missing phenotype items can simply be filled with some extreme values, and be treated as outliers. The efficiency of the LAD-LASSO approach is illustrated on both simulated and real data sets.
机译:基于线性回归的定量特征位点/关联映射方法(例如最小二乘法)通常假定残差为正态。在动植物的遗传学研究中,某些定量性状可能不遵循正态分布,因为数据包括离群的观测值或从多个来源收集的数据,在这种情况下,正常回归方法可能会失去检测定量性状基因座的统计能力。在这项工作中,我们提出了一种鲁棒的多位点回归方法,可以在没有正态假设的情况下分析多个定量特征。在我们的方法中,目标函数是最小绝对偏差(LAD),它对应于多元拉普拉斯分布残差误差的假设。此分布的尾部比正态分布重。此外,我们采用组LASSO罚分法来产生标记效应的收缩估计并描述表型之间的遗传相关性。我们的LAD-LASSO方法对异常值较不敏感,并且更适合用于分析表型分布不均的数据。我们的鲁棒方法的另一个应用是在多特征分析中的缺失表型问题上,其中缺失表型项可以简单地用一些极值填充,并被视为离群值。 LAD-LASSO方法的效率在模拟和真实数据集上都有说明。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号