首页> 外文期刊>ACM transactions on Asian language information processing >A Dependency Parser for Spontaneous Chinese Spoken Language
【24h】

A Dependency Parser for Spontaneous Chinese Spoken Language

机译:自发汉语口语的依存解析器

获取原文
获取原文并翻译 | 示例
       

摘要

Dependency analysis is vital for spoken language understanding in spoken dialogue systems. However, existing research has mainly focused on western spoken languages, Japanese, and so on. Little research has been done for spoken Chinese in terms of dependency parsing. Therefore, the new spoken corpus, D-ESCSC (Dependency-Expressive Speech Corpus of Standard Chinese) is built by adding new dependency relations special to spoken Chinese based on a written Chinese annotation scheme. Since spoken Chinese contains typical ill-grammatical phenomena, e.g., translocation, repetition, duplication, and omission, the new atom feature related to punctuation and three feature templates are proposed to improve a graph-based dependency parser. Experimental results on spoken Chinese corpus show that the atom feature and three templates really work and the new parser outperforms the baseline parser. To our best knowledge, it is the first work to report dependency parsing results of spoken Chinese.
机译:依赖性分析对于口语对话系统中的口语理解至关重要。但是,现有的研究主要集中于西方口语,日语等。在依赖分析方面,对口语的研究很少。因此,基于书面中文注释方案,通过向语音中文添加特殊的新的依赖关系,来构建新的语音语料库D-ESCSC(标准中文的依赖关系表现语音语料库)。由于口语包含典型的不良语法现象,例如易位,重复,重复和省略,因此提出了与标点符号相关的新原子特征和三个特征模板,以改进基于图的依存解析器。对中文语料库的实验结果表明,原子特征和三个模板确实有效,并且新的解析器的性能优于基线解析器。据我们所知,这是第一个报告汉语口语解析的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号