首页> 美国卫生研究院文献>Cancer Informatics >A Comparison of Methods for Data-Driven Cancer Outlier Discovery and An Application Scheme to Semisupervised Predictive Biomarker Discovery
【2h】

A Comparison of Methods for Data-Driven Cancer Outlier Discovery and An Application Scheme to Semisupervised Predictive Biomarker Discovery

机译:数据驱动的癌症离体发现方法的比较以及半监督预测生物标志物发现的应用方案

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

A core component in translational cancer research is biomarker discovery using gene expression profiling for clinical tumors. This is often based on cell line experiments; one population is sampled for inference in another. We disclose a semisupervised workflow focusing on binary (switch-like, bimodal) informative genes that are likely cancer relevant, to mitigate this non-statistical problem. Outlier detection is a key enabling technology of the workflow, and aids in identifying the focus genes.We compare outlier detection techniques MOST, LSOSS, COPA, ORT, OS, and t-test, using a publicly available NSCLC dataset. Removing genes with Gaussian distribution is computationally efficient and matches MOST particularly well, while also COPA and OS pick prognostically relevant genes in their top ranks. Also our stability assessment is in favour of both MOST and COPA; the latter does not pair well with prefiltering for non-Gaussianity, but can handle data sets lacking non-cancer cases.We provide R code for replicating our approach or extending it.
机译:翻译癌症研究的核心组成部分是使用针对临床肿瘤的基因表达谱分析发现生物标志物。这通常基于细胞系实验;一个总体被采样以推断另一个。我们公开了一个半监督的工作流程,重点关注可能与癌症相关的二进制(开关样,双峰式)信息基因,以减轻这种非统计问题。离群值检测是工作流程中的关键启用技术,有助于识别关注基因。我们使用公开的NSCLC数据集比较了离群值检测技术MOST,LSOSS,COPA,ORT,OS和t检验。去除具有高斯分布的基因在计算上是有效的,并且与MOST的匹配特别好,而COPA和OS也在其预后方面选择与预后相关的基因。另外,我们的稳定性评估支持MOST和COPA;后者与非高斯性的预过滤不能很好地配对,但是可以处理缺乏非癌症情况的数据集。我们提供了R代码来复制或扩展方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号