Coarse-to-fine multi-task training of convolutional neural networks for automated information extraction from cancer pathology reports

机译：关于癌症病理报告的自动信息提取的卷积神经网络粗致精细的多任务培训

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Information extraction and coding of free-text pathology reports is an important activity for cancer registries to support national cancer surveillance. Cancer registrars must process high volumes of pathology reports on an annual basis. In this study, we investigated an automated approach using a coarse-to-fine training of convolutional neural networks (CNNs) for extracting the primary site, histological grade and laterality from unstructured cancer pathology text reports. Our proposed training scheme consists of two stages. In the first stage, the multi-task learning (MTL) with hard parameter sharing approach is used to train a multi-task MT-CNN model for all the tasks. Then, the TM-CNN model parameters are used to initialize a CNN model for each task to be fine trained individually using its corresponding dataset. The performance of our proposed approach was compared against a state-of-the-art CNN and the commonly used SVM classifier. We observed that the proposed model consistently outperformed the base line models, especially for the less prevalent classes. Specifically, the proposed training approach achieved a micro-F score of 0.7749 over 12 ICD-O-3 topography codes which is a significant improvement as compared with state-of-the-art CNN (0.7101) and the SVM (0.6019) classifiers. Also, the results demonstrate the potential of the proposed method for handling class imbalance within each task. It significantly improves macro-F score by 24% and 12% of the primary site and histology grade tasks, respectively.

机译：自由文本病理报告的信息提取和编码是癌症注册管理机构支持国家癌症监测的重要活动。癌症注册商必须每年处理大量的病理报告。在这项研究中，我们研究了一种使用对卷积神经网络（CNNS）的粗细训练来提取来自非结构化癌症病理学文本报告的主要部位，组织学等分和横向性的自动化方法。我们拟议的培训计划包括两个阶段。在第一阶段，具有硬参数共享方法的多任务学习（MTL）用于为所有任务培训多项任务MT-CNN模型。然后，TM-CNN模型参数用于初始化每个任务的CNN模型，以使用其相应的数据集单独训练。将我们提出的方法的性能与最先进的CNN和常用的SVM分类器进行比较。我们观察到所提出的模型始终如一地优于基线模型，特别是对于较少的普遍等级。具体地，所提出的培训方法实现了超过12个ICD-O-3形貌码的Micro-F得分为0.7749分，这是与最先进的CNN（0.7101）和SVM（0.6019）分类器相比的显着改善。此外，结果表明了在每个任务中处理类别不平衡的提议方法的潜力。它显着提高了宏-F分别通过24 ％和12 ％的主站点和组织学等级任务来提高宏。

著录项

来源
《IEEE EMBS International Conference on Biomedical and Health Informatics》|2018年|445p|共4页
会议地点
作者
Mohammed Alawad; Hong-Jun Yoon; Georgia D. Tourassi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 R318-53;
关键词
Task analysis; Cancer; Pathology; Training; Information retrieval; Breast; Support vector machines;

机译：任务分析;癌症;病理学;培训;信息检索;乳房;支持矢量机器;

相似文献

外文文献
中文文献
专利

1. Multi-Task Pre-Training of Deep Neural Networks for Digital Pathology [J] . Mormont Romain, Geurts Pierre, Maree Raphael Biomedical and Health Informatics, IEEE Journal of . 2021,第2期

机译：用于数字病理学深神经网络的多任务预培训
2. Deep convolutional neural networks for automated OCT pathology recognition [J] . Russakoff Daniel B., Oakley Jonathan D., Chang Robert Investigative ophthalmology & visual science . 2017,第8期

机译：用于自动化OCT病理识别的深度卷积神经网络
3. Deep convolutional neural networks for automated OCT pathology recognition [J] . Russakoff Daniel B., Oakley Jonathan D., Chang Robert Investigative ophthalmology & visual science . 2017,第8期

机译：用于自动化OCT病理识别的深度卷积神经网络
4. Coarse-to-fine multi-task training of convolutional neural networks for automated information extraction from cancer pathology reports [C] . Mohammed Alawad, Hong-Jun Yoon, Georgia D. Tourassi IEEE EMBS International Conference on Biomedical and Health Informatics . 2018

机译：卷积神经网络的从粗到细的多任务训练，可从癌症病理报告中自动提取信息
5. Cell segmentation in cancer histopathology images using convolutional neural networks. [D] . Kavassery Rajalingam, Viswanathan. 2016

机译：使用卷积神经网络在癌症组织病理学图像中进行细胞分割。
6. Automatic extraction of cancer registry reportable information from free-text pathology reports using multitask convolutional neural networks [O] . Mohammed Alawad, Shang Gao, John X Qiu, 2020

机译：使用Multitask卷积神经网络自动提取癌症注册表的癌症注册表可报告信息
7. Sentiment analysis using convolutional neural networks with multi-task training and distant supervision on italian tweets [O] . Deriu Jan Milan, Cieliebak Mark 2016

机译：使用卷积神经网络进行多任务训练并在意大利推文上进行远程监督的情感分析

Coarse-to-fine multi-task training of convolutional neural networks for automated information extraction from cancer pathology reports

摘要

著录项

相似文献

相关主题

期刊订阅