首页> 美国卫生研究院文献>BMC Genomics >Dynamic association rules for gene expression data analysis
【2h】

Dynamic association rules for gene expression data analysis

机译:基因表达数据分析的动态关联规则

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

BackgroundThe purpose of gene expression analysis is to look for the association between regulation of gene expression levels and phenotypic variations. This association based on gene expression profile has been used to determine whether the induction/repression of genes correspond to phenotypic variations including cell regulations, clinical diagnoses and drug development. Statistical analyses on microarray data have been developed to resolve gene selection issue. However, these methods do not inform us of causality between genes and phenotypes. In this paper, we propose the dynamic association rule algorithm (DAR algorithm) which helps ones to efficiently select a subset of significant genes for subsequent analysis. The DAR algorithm is based on association rules from market basket analysis in marketing. We first propose a statistical way, based on constructing a one-sided confidence interval and hypothesis testing, to determine if an association rule is meaningful. Based on the proposed statistical method, we then developed the DAR algorithm for gene expression data analysis. The method was applied to analyze four microarray datasets and one Next Generation Sequencing (NGS) dataset: the Mice Apo A1 dataset, the whole genome expression dataset of mouse embryonic stem cells, expression profiling of the bone marrow of Leukemia patients, Microarray Quality Control (MAQC) data set and the RNA-seq dataset of a mouse genomic imprinting study. A comparison of the proposed method with the t-test on the expression profiling of the bone marrow of Leukemia patients was conducted.
机译:背景基因表达分析的目的是寻找基因表达水平的调节与表型变异之间的关联。基于基因表达谱的这种关联已被用于确定基因的诱导/抑制是否对应于表型变异,包括细胞调节,临床诊断和药物开发。已经开发了对微阵列数据的统计分析以解决基因选择问题。但是,这些方法不能告诉我们基因和表型之间的因果关系。在本文中,我们提出了动态关联规则算法(DAR算法),它可以帮助人们有效地选择重要基因的子集进行后续分析。 DAR算法基于市场营销中市场篮子分析的关联规则。我们首先提出一种统计方法,该方法基于构造单侧置信区间和假设检验来确定关联规则是否有意义。基于提出的统计方法,我们然后开发了用于基因表达数据分析的DAR算法。该方法用于分析四个微阵列数据集和一个下一代测序(NGS)数据集:小鼠Apo A1数据集,小鼠胚胎干细胞的全基因组表达数据集,白血病患者骨髓的表达谱,微阵列质量控制( MAQC)数据集和小鼠基因组印迹研究的RNA-seq数据集。将该方法与t检验对白血病患者骨髓的表达谱进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号