首页> 外文会议>International workshop on database and expert systems applications >Genomic Data Integration: A Case Study on Next Generation Sequencing of Cancer
【24h】

Genomic Data Integration: A Case Study on Next Generation Sequencing of Cancer

机译:基因组数据整合:癌症新一代测序的案例研究

获取原文

摘要

Due to the great advances of Next Generation Sequencing (NGS) techniques, bioinformaticians are faced with large amounts of genomic and clinical data, which are growing exponentially. A striking example is The Cancer Genome Atlas (TCGA), whose aim is to provide a comprehensive archive of biomedical data about tumors. Indeed, TCGA contains more than 15 TB of genomic and clinical data, whose analysis and interpretation are posing great challenges to the bioinformatics community. In this work, we focus on integration and analysis of NGS data extracted from TCGA. In particular, we integrate RNA-seq and DNA-methylation experiments and perform a supervised classification analysis. Thanks to this integration, we are able to distinguish successfully the tumoral samples from the normal ones and to extract reliable rule-based classification models that contain salient features (i.e., genes and methylated sites). These features, which are related to the investigated tumor, can be studied by domain experts in order to obtain new knowledge about cancer. Finally, our proposed integration and analysis method can be adopted with success for further studies on different data sources and NGS experiments.
机译:由于下一代测序(NGS)技术的巨大进步,生物信息学家面临着成倍增长的大量基因组和临床数据。一个明显的例子是《癌症基因组图谱》(TCGA),其目的是提供有关肿瘤的生物医学数据的全面档案。确实,TCGA包含超过15 TB的基因组和临床数据,其分析和解释对生物信息学界构成了巨大挑战。在这项工作中,我们专注于从TCGA提取的NGS数据的集成和分析。特别是,我们整合了RNA-seq和DNA-甲基化实验,并进行了监督分类分析。由于这种整合,我们能够成功地将肿瘤样品与正常样品区分开,并提取出包含显着特征(即基因和甲基化位点)的基于规则的可靠分类模型。这些与所研究的肿瘤有关的特征可以由领域专家进行研究,以获得有关癌症的新知识。最后,我们提出的整合和分析方法可以成功地用于不同数据源和NGS实验的进一步研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号