首页> 外文期刊>International Journal of Computer Processing of Oriental Languages >Chunk Segmentation of Chinese Sentences Using a Combined Statistical and Rule-based Approach (CSRA)
【24h】

Chunk Segmentation of Chinese Sentences Using a Combined Statistical and Rule-based Approach (CSRA)

机译:统计和规则相结合的方法(CSRA)对汉语句子进行大块分割

获取原文
获取原文并翻译 | 示例
       

摘要

Deep parsing of Chinese sentences is a very challenging task due to their complexity such as ambiguous word boundaries and meanings. An alternative mode of Chinese language processing is to perform shallow parsing of Chinese sentences in which chunk segmentation plays an important role. In this paper, we present a chunk segmentation algorithm using a combined statistical and rule-based approach (CSRA). The decision rules for refining chunk segmentation are generated from incorrectly segmented chunks from a statistical model which is built on a training corpus. Experimental results show that the CSRA works well and produces satisfactory chunk segmentation results for subsequent processes such as chunk tagging and chunk collocation extraction.
机译:由于句子的边界和含义含糊不清,因此对中文句子进行深度解析是一项非常具有挑战性的任务。中文处理的另一种模式是对中文句子进行浅层分析,其中块分割起着重要的作用。在本文中,我们提出了一种使用统计和基于规则的组合方法(CSRA)的块分割算法。用于细化块分割的决策规则是从基于训练语料库的统计模型中未正确分割的块生成的。实验结果表明,CSRA可以很好地工作,并为随后的过程(例如,块标记和块搭配提取)产生令人满意的块分割结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号