Removing the Training Wheels: A Coreference Dataset that Entertains Humans and Challenges Computers

机译：卸下训练轮：娱乐人类并挑战计算机的共参考数据集

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Coreference is a core NLP problem. However, newswire data, the primary source of existing coreference data, lack the richness necessary to truly solve coreference. We present a new domain with denser references-quiz bowl questions-that is challenging and enjoyable to humans, and we use the quiz bowl community to develop a new coreference dataset, together with an annotation framework that can tag any text data with coreferences and named entities. We also successfully integrate active learning into this annotation pipeline to collect documents maximally useful to coreference models. State-of-the-art coreference systems underperform a simple classifier on our new dataset, motivating non-newswire data for future coreference research.

机译：共指是NLP的核心问题。但是，新闻电报数据是现有共同引用数据的主要来源，缺乏真正解决共同引用所需的丰富性。我们提出了一个新的领域，其中包含了更密集的引用（测验碗问题），这对人类来说是具有挑战性和令人愉悦的，并且我们使用测验碗社区来开发新的共同引用数据集，以及可以使用共同引用标记任何文本数据并命名的注释框架实体。我们还将成功的学习成功地集成到该注释管道中，以收集对共参考模型最大有用的文档。先进的共同参照系统在我们的新数据集上的表现不及简单分类器，从而激发了非新闻界数据的未来共同参照研究的动力。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2015年|1108-1118|共11页
会议地点
作者
Anupam Guha; Mohit Iyyer; Danny Bouman; Jordan Boyd-Graber;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. GenCoF: a graphical user interface to rapidly remove human genome contaminants from metagenomic datasets [J] . Czajkowski Matthew D., Vance Daniel P., Frese Steven A., Bioinformatics . 2019,第13期

机译：Gencof：一种快速移除偏见数据集的人类基因组污染物的图形用户界面
2. PARASITES OR BOOSTERS OF HUMAN CREATIVITY?: A MODEL TO SOLVE COPYRIGHT ISSUES OF TRAINING DATASETS OF ARTIFICIAL INTELLIGENCE BEFORE ALPHAART DEFEATS HUMAN ARTISTS [J] . Christina Han AIPLA quarterly journal . 2021,第2期

机译：寄生虫或人类创造力的助推器？：一个模型，用于解决alphaart击败人类艺术家之前训练人工智能的培训数据集的版权问题
3. Removing the training wheels: embracing the social, contextual and psychological in sports medicine [J] . Linda K Truong, Sheree Bekker, Jackie L Whittaker British journal of sports medicine . 2021,第9期

机译：去除训练轮：在体育医学中拥抱社会，情境和心理
4. Removing the Training Wheels: A Coreference Dataset that Entertains Humans and Challenges Computers [C] . Anupam Guha, Mohit Iyyer, Danny Bouman, Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2015

机译：删除训练轮：一个娱乐数据集，可娱乐人类和挑战计算机
5. We Need to Talk About Robustness to Adversarial Attacks while Removing Spurious Dataset Biases [D] . Sachdeva, Bhavdeep Singh. 2021

机译：我们需要在删除虚假数据集偏见时讨论对抗性攻击的鲁棒性
6. RDP5: a computer program for analyzing recombination in and removing signals of recombination from nucleotide sequence datasets [O] . Darren P Martin, Arvind Varsani, Philippe Roumagnac, 2021

机译：RDP5：用于分析重组的计算机程序并从核苷酸序列数据集中去除重组信号
7. Dataset S1 and Dataset S2: DatasetS1 contains the training data used in the “RESULTS section”. DatasetS2 contains the training data used in the “OUTLIER REMOVAL section” [O] . -1

机译：DataSet S1和DataSet S2：DataSets1包含“结果部分”中使用的培训数据。 DataSets2包含“异常删除部分”中使用的培训数据
8. NATOs Relevance to United States Enduring National Interests Time to Remove the Training Wheels but Continue to Hold the Handle Bars. [R] . Counihan, S. F. 2016

机译：北约与美国持久的国家利益相关时间去除训练轮但继续握住手柄杆。

Removing the Training Wheels: A Coreference Dataset that Entertains Humans and Challenges Computers

摘要

著录项

相似文献

相关主题

期刊订阅