首页> 外文会议>Workshop on Chinese Lexical Semantics >Automatic Recognition of Chinese Separable Words Based on CRFs
【24h】

Automatic Recognition of Chinese Separable Words Based on CRFs

机译:基于CRF的中国可分离单词自动识别

获取原文

摘要

Currently, most of the automatic recognition tasks of separable words adopt a rule-based method, which relies on automatic word segmentation results and lexical patterns generated from common inserted constituents. However, they suffer from incorrect word segmentation results and inaccurate and limited rules. Moreover, they ignore the rich information contained in the context. To address these issues, this paper proposes a CRFs-based method which employs nine features, such as character, POS tag, punctuation, word boundary, keyword and POS sequential rule. Experimental results on real-world datasets show that our approach can make full use of rich information and achieve significant improvements on recognition efficiency compared to all the baselines.
机译:目前,可分离单词的大多数自动识别任务采用基于规则的方法,它依赖于来自公共插入组件产生的自动词分段结果和词汇模式。但是,它们遭受了错误的单词分割结果和不准确和有限的规则。此外,它们忽略了上下文中包含的丰富信息。要解决这些问题,本文提出了一种基于CRF的方法,该方法采用了九个特征,例如字符,POS标记,标点符号,单词边界,关键字和POS顺序规则。实验结果对现实世界数据集表明,与所有基线相比,我们的方法可以充分利用丰富的信息并实现对识别效率的显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号