...
首页> 外文期刊>BMC Bioinformatics >Large-scale integration of cancer microarray data identifies a robust common cancer signature
【24h】

Large-scale integration of cancer microarray data identifies a robust common cancer signature

机译:癌症微阵列数据的大规模整合确定了可靠的常见癌症特征

获取原文
           

摘要

Background There is a continuing need to develop molecular diagnostic tools which complement histopathologic examination to increase the accuracy of cancer diagnosis. DNA microarrays provide a means for measuring gene expression signatures which can then be used as components of genomic-based diagnostic tests to determine the presence of cancer. Results In this study, we collect and integrate ~ 1500 microarray gene expression profiles from 26 published cancer data sets across 21 major human cancer types. We then apply a statistical method, referred to as the T op- S coring P air of G roups (TSPG) classifier, and a repeated random sampling strategy to the integrated training data sets and identify a common cancer signature consisting of 46 genes. These 46 genes are naturally divided into two distinct groups; those in one group are typically expressed less than those in the other group for cancer tissues. Given a new expression profile, the classifier discriminates cancer from normal tissues by ranking the expression values of the 46 genes in the cancer signature and comparing the average ranks of the two groups. This signature is then validated by applying this decision rule to independent test data. Conclusion By combining the TSPG method and repeated random sampling, a robust common cancer signature has been identified from large-scale microarray data integration. Upon further validation, this signature may be useful as a robust and objective diagnostic test for cancer.
机译:背景技术继续需要开发补充组织病理学检查以提高癌症诊断准确性的分子诊断工具。 DNA微阵列提供了一种测量基因表达特征的手段,然后可以将其用作基于基因组的诊断测试的组成部分,以确定癌症的存在。结果在这项研究中,我们收集并整合了来自21种主要人类癌症类型的26种已公开癌症数据集的1500个微阵列基因表达谱。然后,我们将一种统计方法(称为G群的T op-S取心P空气分类法(TSPG)分类器)和一种重复的随机抽样策略应用于综合训练数据集,并确定由46个基因组成的常见癌症特征。这46个基因自然分为两组。对于癌症组织,一组中的表达通常低于另一组中的表达。给定新的表达方式,分类器通过对癌症特征中的46个基因的表达值进行排名并比较两组的平均排名,来将癌症与正常组织区分开。然后通过将此决策规则应用于独立的测试数据来验证此签名。结论通过结合TSPG方法和重复随机抽样,已从大规模微阵列数据集成中确定了可靠的常见癌症特征。经过进一步验证,此签名可用作对癌症进行可靠且客观的诊断测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号