Er... well, it matters, right? On the role of data representations in spoken language dependency parsing

机译：呃......好吧，它很重要，对吧？关于数据表示在语言依赖解析中的作用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Despite the significant improvement of data-driven dependency parsing systems in recent years, they still achieve a considerably lower performance in parsing spoken language data in comparison to written data. On the example of Spoken Slovenian Treebank, the first spoken data treebank using the UD annotation scheme, we investigate which speech-specific phenomena undermine parsing performance, through a series of training data and treebank modification experiments using two distinct state-of-the-art parsing systems. Our results show that utterance segmentation is the most prominent cause of low parsing performance, both in parsing raw and pre-segmented transcriptions. In addition to shorter utterances, both parsers perform better on normalized transcriptions including basic markers of prosody and excluding disfiuencies, discourse markers and fillers. On the other hand, the effects of written training data addition and speech-specific dependency representations largely depend on the parsing system selected.

机译：尽管近年来数据驱动依赖解析系统的重大改进，但与书面数据相比，它们仍然在解析口语数据方面仍然达到了相当低的性能。在口头斯洛文尼亚树班库的例子上，使用UD注释方案的第一个口头数据树银行，我们调查哪些语音特定现象破坏解析性能，通过使用两个不同的最先进的培训数据和树木银行改装实验进行解析性能解析系统。我们的研究结果表明，话语分割是解析原始和预分段的转录中解析性能的最突出原因。除了较短的话语之外，两个解析剂在规范化的转录上表现出更好的，包括韵律的基本标记，并不包括无差异，话语标记和填料。另一方面，书面训练数据添加和语音特定依赖关系的影响主要取决于所选解析系统。

著录项

来源
《Conference on empirical methods in natural language processing》|2018年|xiv 201 p.|共10页
会议地点
作者
Kaja Dobrovoljc; Matej Martinc;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. A Dependency Parser for Spontaneous Chinese Spoken Language [J] . He Ruifang, Wang Yaru, Song Dawei, ACM transactions on Asian language information processing . 2018,第4期

机译：自发汉语口语的依存解析器
2. Robust Dependency Parsing of Spontaneous Japanese Spoken Language [J] . Tomohiro OHNO, Shigeki MATSUBARA, Nobuo KAWAGUCHI, IEICE Transactions on Information and Systems . 2005,第3期

机译：自发日语口语的鲁棒依赖解析
3. MaltParser: A language-independent system for data-driven dependency parsing [J] . JOAKIM NIVRE, JOHAN HALL, JENS NILSSON, Natural language engineering . 2007,第Pt2期

机译：MaltParser：一种独立于语言的系统，用于数据驱动的依赖项解析
4. Er... well, it matters, right? On the role of data representations in spoken language dependency parsing [C] . Kaja Dobrovoljc, Matej Martinc Second workshop on universal dependencies . 2018

机译：嗯...很好，对吧？数据表示在口语依赖解析中的作用
5. Leveraging Training Data from High-Resource Languages to Improve Dependency Parsing for Low-Resource Languages [D] . Jaja, Claire. 2014

机译：利用来自高资源语言的培训数据来改善对低资源语言的依赖关系解析
6. Benchmarking natural-language parsers for biological applications using dependency graphs [O] . Andrew B Clegg, Adrian J Shepherd 2007

机译：使用依赖关系图对自然语言解析器进行生物应用基准测试
7. Er ... well, it matters, right? On the role of data representations in spoken language dependency parsing [O] . Kaja Dobrovoljc, Matej Martinc 2018

机译：呃......好吧，它很重要，对吧？关于数据表示在语言依赖解析中的作用

Er... well, it matters, right? On the role of data representations in spoken language dependency parsing

摘要

著录项

相似文献

相关主题

期刊订阅