首页> 外文会议>Conference on empirical methods in natural language processing >The Hebrew Universal Dependency Treebank: Past, Present and Future
【24h】

The Hebrew Universal Dependency Treebank: Past, Present and Future

机译:希伯来普遍依赖树银行:过去,现在和未来

获取原文

摘要

The Hebrew treebank (HTB), consisting of 6221 morpho-syntactically annotated newspaper sentences, has been the only resource for training and validating statistical parsers and taggers for Hebrew, for almost two decades now. During these decades, the HTB has gone through a trajectory of automatic and semi-automatic conversions, until arriving at its UDv2 form. In this work we manually validate the UDv2 version of the HTB, and, according to our findings, we apply scheme changes that bring the UD HTB to the same theoretical grounds as the rest of UD. Our experimental parsing results with UDv2New confirm that improving the coherence and internal consistency of the UD HTB indeed leads to improved parsing performance. At the same time, our analysis demonstrates that there is more to be done at the point of intersection of UD with other linguistic processing layers, in particular, at the points where UD interfaces external morphological and lexical resources.
机译:希伯来树银行(HTB)由6221个句法附加报纸判决组成,这是近二十年来培训和验证统计解析器和标签的唯一资源。在这些数十年中,HTB已经经历了自动和半自动转换的轨迹,直到抵达其UDV2形式。在这项工作中,我们手动验证了HTB的UDV2版本,并根据我们的调查结果,我们应用方案更改,使UD HTB与UD的其余部分相同的理论场。我们的实验结果与UDV2New的结果证实,提高了UD HTB的相干性和内部一致性,确实导致改善解析性能。与此同时,我们的分析表明,在UD与其他语言处理层的地点的交点中有更多待完成的是,在UD接口外部形态和词汇资源的点处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号