首页> 外文会议>International Symposium on Natural Language Processing >Thai Named Entity Recognition Based on Conditional Random Fields
【24h】

Thai Named Entity Recognition Based on Conditional Random Fields

机译:基于条件随机字段的泰国命名实体识别

获取原文

摘要

This paper presents the Thai named entity recognition (NER) systems using Conditional Random Fields (CRFs). In the previous studies of Thai NER, there are not any systems using syllable-segmented data as an input but word-segmented one. Since the results of some researches on NER in other languages such as Chinese show that the systems based on character are better than those based on word, this study is also conducted to find out if the syllable-segmented input helps improve Thai NER. In order to compare the system getting word-segmented input to that getting syllable-segmented input, there will be two sets of features used in the systems in this study. The results of the experiment show that the systems do not perform well enough due to few features used. However, it reveals that the syllable-based system is slightly better than the word-based one. The corpus, training data preparation and system overview are also included in this paper.
机译:本文介绍了使用条件随机字段(CRF)的命名实体识别(NER)系统。在泰语前面的研究中,没有任何系统使用音节分段数据作为输入而是单词分段。由于诸如中文这样的其他语言的NER研究结果表明,基于角色的系统比基于Word的系统更好,还进行了该研究,以了解音节分段的输入是否有助于改进泰语。为了将系统与获取音节分段输入的字段分段输入进行比较,在本研究中的系统中将存在两组特征。实验结果表明,由于使用的功能很少,系统不会足够好。但是,它揭示了基于音节的系统比基于词的系统略好。本文还包括语料库,培训数据准备和系统概述。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号