首页> 外文期刊>Machine translation >A Morphological Tagger for Korean: Statistical Tagging Combined with Corpus-Based Morphological Rule Application
【24h】

A Morphological Tagger for Korean: Statistical Tagging Combined with Corpus-Based Morphological Rule Application

机译:韩国语的形态学标记词:统计标记结合基于语料库的形态学规则应用

获取原文
获取原文并翻译 | 示例
           

摘要

This paper describes a novel approach to morphological tagging for Korean, an agglutinative language with a very productive inflectional system. The tagger takes raw text as input and returns a lemmatized and morphologically disambiguated output for each word: the lemma is labeled with a part-of-speech (POS) tag and the inflections are labeled with inflectional tags. Unlike the standard approach to tagging for morphologically complex languages, in our proposed approach the tagging phase precedes the analysis phase. It comprises a trigram-based tagging component followed by a morphological rule application component, obtaining 95% precision and recall on unseen test data.
机译:本文介绍了一种新的朝鲜语形态标记方法,朝鲜语是一种具有高产拐点系统的凝集性语言。标记器将原始文本作为输入,并为每个单词返回经过词形化和词形消除的输出:词性用词性(POS)标签标记,而词尾用词缀标记标记。与形态学上复杂的语言的标准标记方法不同,在我们提出的方法中,标记阶段先于分析阶段。它包括一个基于Trigram的标记组件,其后是一个形态规则应用程序组件,可获得95%的精度,并可对看不见的测试数据进行调用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号