Distilling an Ensemble of Greedy Dependency Parsers into One MST Parser

机译：将一组贪心依赖解析器提取到一个MST解析器中

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We introduce two first-order graph-based dependency parsers achieving a new state of the art. The first is a consensus parser built from an ensemble of independently trained greedy LSTM transition-based parsers with different random initializations. We cast this approach as minimum Bayes risk decoding (under the Hamming cost) and argue that weaker consensus within the ensemble is a useful signal of difficulty or ambiguity. The second parser is a "distillation" of the ensemble into a single model. We train the distillation parser using a structured hinge loss objective with a novel cost that incorporates ensemble uncertainty estimates for each possible attachment, thereby avoiding the intractable cross-entropy computations required by applying standard distillation objectives to problems with structured outputs. The first-order distillation parser matches or surpasses the state of the art on English, Chinese, and German.

机译：我们介绍了两个基于图的一阶依赖解析器，它们实现了最新的技术水平。第一个是共识解析器，该解析器是由具有不同随机初始化的独立训练的基于贪婪LSTM过渡的解析器集成而成的。我们将此方法视为最小贝叶斯风险解码（在汉明成本法下），并认为集合内较弱的共识是困难或模棱两可的有用信号。第二个解析器是将集合“蒸馏”为单个模型。我们使用结构化的铰链损耗物镜训练蒸馏解析器，并采用新颖的成本，该成本结合了每种可能附件的整体不确定性估计，从而避免了通过将标准蒸馏物镜应用于结构化输出问题而需要进行的棘手的交叉熵计算。一阶蒸馏解析器在英语，中文和德语上达到或超过了最新技术水平。

著录项

来源
《Conference on empirical methods in natural language processing》|2016年|1744-1753|共10页
会议地点
作者
Adhiguna Kuncoro; Miguel Ballesteros; Lingpeng Kong; Chris Dyer; Noah A. Smith;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Dependency Parsing Schemata and Mildly Non-Projective Dependency Parsing [J] . Carlos Gómez-Rodrígue, John Carrol, David Wei Computational linguistics . 2011,第3期

机译：依赖项解析方案和轻度非投影依赖项解析
2. Using Short Dependency Relations from Auto-Parsed Data for Chinese Dependency Parsing [J] . WENLIANG CHEN, DAISUKE KAWAHARA, KIYOTAKA UCHIMOTO, ACM transactions on Asian language information processing . 2009,第3期

机译：使用自动解析数据中的短依赖性关系进行中文依赖性分析
3. Greedy Transition-Based Dependency Parsing with Stack LSTMs [J] . Miguel Ballesteros, Chris Dyer, Yoav Goldberg, Computational linguistics . 2017,第2期

机译：带有堆栈LSTM的基于贪婪过渡的依存关系解析
4. Distilling an Ensemble of Greedy Dependency Parsers into One MST Parser [C] . Adhiguna Kuncoro, Miguel Ballesteros, Lingpeng Kong, Conference on empirical methods in natural language processing . 2016

机译：将贪婪的依赖解析器蒸馏成一个MST解析器
5. Ensembles of diverse clustering-based discriminative dependency parsers. [D] . Razavi, Marzieh. 2012

机译：基于聚类的判别依赖性解析器的集合。
6. Dependency parsing of biomedical text with BERT [O] . Jenna Kanerva, Filip Ginter, Sampo Pyysalo 2020

机译：用伯特解析生物医学文本的依赖关系
7. Distilling an Ensemble of Greedy Dependency Parsers into One MST Parser [O] . Kuncoro, Adhiguna, Ballesteros, Miguel, Kong, Lingpeng, 2016

机译：将一组贪婪的依赖解析器提炼成一个msT解析器

Distilling an Ensemble of Greedy Dependency Parsers into One MST Parser

摘要

著录项

相似文献

相关主题

期刊订阅