首页> 外文会议>Conference on empirical methods in natural language processing >Mind the Gap: Data Enrichment in Dependency Parsing of Elliptical Constructions
【24h】

Mind the Gap: Data Enrichment in Dependency Parsing of Elliptical Constructions

机译:介绍差距:依赖于椭圆结构的数据富集

获取原文

摘要

In this paper, we focus on parsing rare and non-trivial constructions, in particular ellipsis. We report on several experiments in enrichment of training data for this specific construction, evaluated on five languages: Czech, English, Finnish, Russian and Slovak. These data enrichment methods draw upon self-training and tri-training, combined with a stratified sampling method mimicking the structural complexity of the original treebank. In addition, using these same methods, we also demonstrate small improvements over the CoNLL-17 parsing shared task winning system for four of the five languages, not only restricted to the elliptical constructions.
机译:在本文中,我们专注于解析稀有和非琐碎的结构,特别是省略号。我们报告了若干实验,在丰富这一具体建筑的培训数据,评估了五种语言:捷克,英语,芬兰语,俄罗斯和斯洛伐克。这些数据富集方法借鉴了自我训练和三训练,结合了模拟原始树木结构的结构复杂性的分层采样方法。此外,使用这些相同的方法,我们还展示了对四种语言中的四种的Conll-17解析共享任务获奖系统的小改进,不仅限于椭圆结构。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号