首页> 美国卫生研究院文献>G3: GenesGenomesGenetics >Prediction of Multiple-Trait and Multiple-Environment Genomic Data Using Recommender Systems
【2h】

Prediction of Multiple-Trait and Multiple-Environment Genomic Data Using Recommender Systems

机译:使用推荐系统预测多性状和多环境基因组数据

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In genomic-enabled prediction, the task of improving the accuracy of the prediction of lines in environments is difficult because the available information is generally sparse and usually has low correlations between traits. In current genomic selection, although researchers have a large amount of information and appropriate statistical models to process it, there is still limited computing efficiency to do so. Although some statistical models are usually mathematically elegant, many of them are also computationally inefficient, and they are impractical for many traits, lines, environments, and years because they need to sample from huge normal multivariate distributions. For these reasons, this study explores two recommender systems: item-based collaborative filtering (IBCF) and the matrix factorization algorithm (MF) in the context of multiple traits and multiple environments. The IBCF and MF methods were compared with two conventional methods on simulated and real data. Results of the simulated and real data sets show that the IBCF technique was slightly better in terms of prediction accuracy than the two conventional methods and the MF method when the correlation was moderately high. The IBCF technique is very attractive because it produces good predictions when there is high correlation between items (environment–trait combinations) and its implementation is computationally feasible, which can be useful for plant breeders who deal with very large data sets.
机译:在启用基因组的预测中,提高环境中线的预测准确性的任务是困难的,因为可用信息通常稀疏并且通常在性状之间具有较低的相关性。在当前的基因组选择中,尽管研究人员拥有大量信息和适当的统计模型来进行处理,但这样做的计算效率仍然有限。尽管某些统计模型通常在数学上是优雅的,但其中许多模型的计算效率也很低,并且对于许多特征,品系,环境和年份而言,它们都不切实际,因为它们需要从庞大的正态多元分布中进行抽样。由于这些原因,本研究探索了两个推荐系统:在多个特征和多个环境的情况下,基于项目的协作过滤(IBCF)和矩阵分解算法(MF)。将IBCF和MF方法与两种常规方法进行了模拟和真实数据的比较。模拟和真实数据集的结果表明,在相关度适度较高的情况下,IBCF技术在预测准确性方面比两种常规方法和MF方法稍好。 IBCF技术非常吸引人,因为当项目之间(环境-性状组合)之间具有高度相关性时,它会产生良好的预测,并且它的实现在计算上是可行的,这对于处理非常大的数据集的植物育种者很有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号