...
首页> 外文期刊>IEICE Transactions on Information and Systems >Joint Chinese Word Segmentation and POS Tagging Using an Error-Driven Word-Character Hybrid Model
【24h】

Joint Chinese Word Segmentation and POS Tagging Using an Error-Driven Word-Character Hybrid Model

机译:使用错误驱动的字-字符混合模型的联合中文分词和POS标记

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we present a discriminative word-character hybrid model for joint Chinese word segmentation and POS tagging. Our word-character hybrid model offers high performance since it can handle both known and unknown words. We describe our strategies that yield good balance for learning the charncteristics of known and unknown words and propose an error-driver peliey that delivers such balance by acquiring examples of unknown wo ds from particular errors in a training corpus. We describe an efficient fiamework for training our model based on the Margin Infused Relaxed Algorilhm (MIRA). evaluate our approach on the Penn Chinese Treebank. and show that it achieves superior performance compared to the state-of-the-art approaches reported in the literature.
机译:在本文中,我们提出了用于联合中文分词和POS标记的判别词-字符混合模型。我们的单词-字符混合模型提供了高性能,因为它可以处理已知和未知的单词。我们描述了在学习已知词和未知词的特性方面产生良好平衡的策略,并提出了一种错误驱动程序原理,该方法通过从训练语料库中的特定错误中获取未知示例来实现这种平衡。我们描述了一种有效的火焰模型,用于基于Margin Infused Relaxed Algorilhm(MIRA)训练我们的模型。在Penn Chinese Treebank上评估我们的方法。并表明,与文献中报道的最新方法相比,它具有更高的性能。

著录项

  • 来源
    《IEICE Transactions on Information and Systems》 |2009年第12期|2298-2305|共8页
  • 作者单位

    Graduate School of Engineering, Kobe University, Kobe-shi, 657-8501 Japan National Institute of information and Communications Technology, Kyoto-fu, 619-0289 Japan;

    Graduate School of Engineering, Kobe University, Kobe-shi, 657-8501 Japan;

    Graduate School of Engineering, Kobe University, Kobe-shi, 657-8501 Japan;

    Graduate School of Engineering, Kobe University, Kobe-shi, 657-8501 Japan;

    Graduate School of Engineering, Kobe University, Kobe-shi, 657-8501 Japan;

    Graduate School of Engineering, Kobe University, Kobe-shi, 657-8501 Japan National Institute of information and Communications Technology, Kyoto-fu, 619-0289 Japan;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    word segmentation; POS tagging; error-driven; word-character hybrid model;

    机译:分词POS标签;错误驱动字-字符混合模型;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号