首页> 外文会议>Systems and Information Engineering Design Symposium >Using autoencoders and text mining to characterize single cell populations in the hippocampus and cortex
【24h】

Using autoencoders and text mining to characterize single cell populations in the hippocampus and cortex

机译:使用AutoEncoders和Text Mining在海马和皮质中表征单个小区群体

获取原文

摘要

Revolutionary advances in genomic technology have allowed researchers to address biological questions about cell types, states, and gene regulation at the scale of single cells. However, the ability to characterize gene expression and function of individual cells brings with it new data-related challenges, such as dimensionality, feature reduction, and noise reduction. The central objective of this research was to use existing methods in a novel application of single-cell gene expression data to better characterize sub-populations of cell types in various regions of the brain. This research approach used a computational bioinformatics pipeline for single-cell RNA-sequencing (RNA-seq) normalization and clustering. Data science methodologies, such as autoencoding and text mining, were adapted to identify candidate gene sets that distinguish different types of cells in the central nervous system. Then, the functional themes of these gene sets were inferred using a combination of functional enrichment of gene ontology terms and topic modeling. Topic modeling revealed various functional themes among the clusters, in some cases reinforcing the results of biomarker analysis, and in other cases providing further insight into potential functional differences between clusters. For one cluster in the cortex, an immune theme emerged with stemmed-words as specific as “immun” and “antigen” appearing in the results. In the hippocampus, clusters determined to be neurons could be further differentiated as themes related to various organs were identified. One of these clusters featured a vascular theme with words related to “endotheli.” Future applications of these methods intend to expound upon specific cellular processes, in relation to immune function, and translational research on neurological disease states.
机译:基因组技术的革命性进展使研究人员能够在单细胞范围内解决细胞类型,州和基因调节的生物学问题。然而,表征各个细胞基因表达和功能的能力带来了新的数据相关的挑战,例如维度,特征减少和降噪。该研究的中心目标是使用在单细胞基因表达数据的新应用中使用现有方法,以更好地表征大脑的各个区域中细胞类型的子群。该研究方法使用用于单细胞RNA测序(RNA-SEQ)归一化和聚类的计算生物信息化管线。数据科学方法,例如自身谱和文本挖掘,适于识别区分中枢神经系统中不同类型细胞的候选基因集。然后,使用基因本体论术语和主题建模的功能性富集的组合推断出这些基因集的功能主题。主题建模在群集中揭示了各种功能主题,在一些情况下,在一些情况下加强了生物标志物分析的结果,并且在其他情况下,进一步了解簇之间的潜在功能差异。对于皮质中的一个簇,呈现出特异性为“IMMUN”和“抗原”的免疫主题出现在结果中。在海马中,可以进一步将确定为神经元的簇作为鉴定与各种器官相关的主题。其中一个集群用与“内皮”有关的词语来精选血管主题。这些方法的未来应用打算阐述特定的细胞过程,关于免疫功能以及神经疾病状态的翻译研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号