Incorporating Figure Captions and Descriptive Text in MeSH Term Indexing

机译：在MeSH术语索引中结合图形标题和描述性文本

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The goal of text classification is to automatically assign categories to documents. Deep learning automatically learns effective features from data instead of adopting human-designed features. In this paper, we focus specifically on biomedical document classification using a deep learning approach. We present a novel multichannel TextCNN model for MeSH term indexing. Beyond the normal use of the text from the abstract and title for model training, we also consider figure and table captions, as well as paragraphs associated with the figures and tables. We demonstrate that these latter text sources are important feature sources for our method. A new dataset consisting of these text segments curated from 257,590 full text articles together with the articles' MED-LINE/PubMed MeSH terms is publicly available.

机译：文本分类的目的是自动为文档分配类别。深度学习会自动从数据中学习有效的功能，而不是采用人工设计的功能。在本文中，我们专门研究使用深度学习方法的生物医学文献分类。我们提出了一种新颖的用于MeSH词索引的多通道TextCNN模型。除了正常使用摘要和标题中的文本来进行模型训练外，我们还考虑图形和表格标题以及与图形和表格相关的段落。我们证明后面的这些文本源是我们方法的重要特征源。由257,590篇全文文章以及这些文章的MED-LINE / PubMed MeSH术语精心策划的包含这些文本片段的新数据集可公开获得。

著录项

来源
《SIGBioMed workshop on biomedical natural language processing;Annual meeting of the Association for Computational Linguistics》|2019年|165-175|共11页
会议地点 Florence(IT)
作者
Xindi Wang; Robert E. Mercer;
展开▼
作者单位

Department of Computer Science The University of Western Ontario London Ontario Canada;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Automated semantic indexing of figure captions to improve radiology image retrieval. [J] . Kahn-CE Jr, Rubin DL Journal of the American Medical Informatics Association : . 2009,第3期

机译：图形标题的自动语义索引，以改善放射图像的检索。
2. A Caption Text Detection Method from Images/Videos for Efficient Indexing and Retrieval of Multimedia Data [J] . Samabia Tehsin, Asif Masood, Sumaira Kausar, International Journal of Pattern Recognition and Artificial Intelligence . 2015,第1期

机译：从图像/视频的字幕文本检测方法，以有效地索引和检索多媒体数据
3. FullMeSH: improving large-scale MeSH indexing with full text [J] . Dai Suyang, You Ronghui, Lu Zhiyong, Bioinformatics . 2020,第5期

机译：FullMesh：用全文改进大规模网格索引
4. Incorporating Figure Captions and Descriptive Text in MeSH Term Indexing [C] . Xindi Wang, Robert E. Mercer SIGBioMed workshop on biomedical natural language processing . 2019

机译：在网格术语索引中包含图形标题和描述性文本
5. Three essays on short-term interest rate and indexed bond markets: Essay I. Reexamination of short-term interest rate models: A repo rate market perspective. Essay II. An inflationary or a disinflationary regime? Evidence from maturing United States treasury inflation-indexed securities. Essay III. Risk perception and information aggregation about inflation: The case of U.K. index-linked gilts. [D] . Chen, Jeng-Hong. 2004

机译：关于短期利率和指数债券市场的三篇文章：论文I.短期利率模型的重新检验：回购利率市场的观点。论文二。通货膨胀还是通货膨胀的制度？美国国库通胀指数证券到期的证据。论文三。有关通货膨胀的风险感知和信息汇总：以英国指数挂钩的后备母猪为例。
6. Automated Semantic Indexing of Figure Captions to Improve Radiology Image Retrieval [O] . Charles E. Kahn Jr., Daniel L. Rubin 2009

机译：图形字幕的自动语义索引可改善放射图像的检索
7. Caption text extraction for indexing purposes using a hierarchical region-based image model [O] . León Cristóbal Míriam, Vilaplana Besler Verónica, Gasull Llampallas Antoni, 2010

机译：使用基于分层区域的图像模型进行索引的标题文本提取

Incorporating Figure Captions and Descriptive Text in MeSH Term Indexing

摘要

著录项

相似文献

相关主题

期刊订阅