Making the most of limited training data using distant supervision

机译：通过远程监督充分利用有限的培训数据

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic recognition of relationships between key entities in text is an important problem which has many applications. Supervised machine learning techniques have proved to be the most effective approach to this problem. However, they require labelled training data which may not be available in sufficient quantity (or at all) and is expensive to produce. This paper proposes a technique that can be applied when only limited training data is available. The approach uses a form of distant supervision but does not require an external knowledge base. Instead, it uses information from the training set to acquire new labelled data and combines it with manually labelled data. The approach was tested on an adverse drug data set using a limited amount of manually labelled training data and shown to outperform a supervised approach.

机译：文本中关键实体之间关系的自动识别是一个重要的问题，具有许多应用。监督机器学习技术已被证明是解决此问题的最有效方法。但是，他们需要标记的训练数据，这些数据可能没有足够的数量（或根本没有）并且生产成本很高。本文提出了一种仅在有限的训练数据可用时可以应用的技术。该方法采用了远程监管的形式，但不需要外部知识库。相反，它使用训练集中的信息来获取新的标记数据，并将其与手动标记的数据组合。使用有限数量的手动标记训练数据对不良药物数据集进行了测试，结果表明该方法优于监督方法。

著录项

来源
《Workshop on biomedical natural language processing 2015》|2015年|12-20|共9页
会议地点 Beijing(CA)
作者
Roland Roller; Mark Stevenson;
展开▼
作者单位

Department of Computer Science University of Sheffield Regent Court, 211 Portobello S1 4DP Sheffield, England;

Department of Computer Science University of Sheffield Regent Court, 211 Portobello S1 4DP Sheffield, England;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. On the Construction of Web NER Model Training Tool based on Distant Supervision [J] . Chou Chien-Lung, Chang Chia-Hui, Lin Yuan-Hao, ACM transactions on Asian language information processing . 2020,第6期

机译：基于遥远监督的Web Ner模型培训工具建设
2. Distant supervision of relation extraction in sparse data [J] . Ranjbar-Sahraei Bijan, Rahmani Hossein, Weiss Gerhard, Intelligent data analysis . 2019,第5期

机译：稀疏数据中关系提取的遥远监督
3. Predicting Twitter User Demographics using Distant Supervision from Website Traffic Data [J] . Culotta Aron, Cutler Jennifer, Ravi Nirmal Kumar The Journal of Artificial Intelligence Research . 2016,第12期

机译：根据网站流量数据使用远程监督预测Twitter用户人口统计
4. Making the most of limited training data using distant supervision [C] . Roland Roller, Mark Stevenson Workshop on biomedical natural language processing . 2015

机译：使用遥远的监督充分利用有限的培训数据
5. Learning with Limited Labeled Data in Biomedical Domain by Disentanglement and Semi-Supervised Learning [D] . Gyawali, Prashnna Kumar. 2021

机译：通过解剖学和半监督学习在生物医学领域的有限标记数据学习
6. A hybrid approach toward biomedical relation extraction training corpora: combining distant supervision with crowdsourcing [O] . Diana Sousa, Andre Lamurias, Francisco M Couto 2020

机译：一种对生物医学关系提取训练训练的混合方法：与众包相结合
7. Making the most of limited training data using distant supervision [O] . Roland Roller, Mark Stevenson 2015

机译：使用遥远的监督充分利用有限的培训数据

Making the most of limited training data using distant supervision

摘要

著录项

相似文献

相关主题

期刊订阅