Dataset Mention Extraction and Classification

机译：数据集提及提取和分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Datasets are integral artifacts of empirical scientific research. However, due to natural language variation, their recognition can be difficult and even when identified, can often be inconsistently referred across and within publications. We report our approach to the Coleridge Initiative's Rich Context Competition, which tasks participants with identifying dataset surface forms (dataset mention extraction) and associating the extracted mention to its referred dataset (dataset classification). In this work, we propose various neural baselines and evaluate these model on one-plus and zero-shot classification scenarios. We further explore various joint learning approaches - exploring the synergy between the tasks - and report the issues with such techniques.

机译：数据集是经验科学研究不可或缺的产物。但是，由于自然语言的变化，它们的识别可能会很困难，甚至在被识别时也常常会在出版物中和出版物中前后不一致地被提及。我们向Coleridge Initiative的Rich Context Competition报告了我们的方法，该竞赛要求参与者识别数据集表面形式（数据集提及提取），并将提取的提及关联到其引用的数据集（数据集分类）。在这项工作中，我们提出了各种神经基线，并在一加和零击分类方案中评估了这些模型。我们进一步探索各种联合学习方法-探索任务之间的协同作用-并报告此类技术的问题。

著录项

来源
《Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies;Workshop on extraction of structured knowledge from scientific publications》|2019年|31-36|共6页
会议地点 Minneapolis(US)
作者
Animesh Prasad; Chenglei Si; Min-Yen Kan;
展开▼
作者单位

School of Computing National University of Singapore;

River Valley High School Singapore;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Tensor decompositions for feature extraction and classification of high dimensional datasets [J] . Anh Huy Phan, Andrzej Cichocki Nonlinear Theory and Its Applications . 2010,第1期

机译：用于高维数据集特征提取和分类的张量分解
2. Softcite dataset: A dataset of software mentions in biomedical and economic research publications [J] . Caifan Du, Johanna Cohoon, Patrice Lopez, Journal of the Association for Information Science and Technology . 2021,第7期

机译：Softcite DataSet：生物医学和经济研究出版物中的软件提到数据集
3. A Bi-LSTM mention hypergraph model with encoding schema for mention extraction [J] . Lin Jerry Chun-Wei, Shao Yinan, Zhou Yujie, Engineering Applications of Artificial Intelligence . 2019,第Octa期

机译：Bi-LSTM提及超图模型，具有用于提及提取的编码方案
4. Dataset Mention Extraction and Classification [C] . Animesh Prasad, Chenglei Si, Min-Yen Kan Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies . 2019

机译：数据集提取提取和分类
5. Iterative Cell Extraction and Registration for Analysis of Time-Lapse Neural Calcium Imaging Datasets [D] . ?Tasci, Tugce 2020

机译：迭代细胞提取和注册分析延时神经钙成像数据集
6. Automated Extraction and Classification of Cancer Stage Mentions fromUnstructured Text Fields in a Central Cancer Registry [O] . Abdulrahman K. AAlAbdulsalam, Jennifer H. Garvin, Andrew Redd, 2018

机译：从中央癌症登记处非结构化文本字段中自动提取和分类癌症分期说明
7. Joint Mention Extraction and Classification with Mention Hypergraphs [O] . Wei Lu, Dan Roth 2015

机译：提及超图的联合提及提取与分类
8. T-Cube: A Data Structure for Fast Extraction of Time Series from Large Datasets [R] . Sabhnani, M. , Moore, A. W. , Dubrawski, A. W. 2007

机译：T-Cube：一种从大型数据集中快速提取时间序列的数据结构

Dataset Mention Extraction and Classification

摘要

著录项

相似文献

相关主题

期刊订阅