首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Open Vocabulary Learning for Neural Chinese Pinyin IME
【24h】

Open Vocabulary Learning for Neural Chinese Pinyin IME

机译:神经汉语拼音输入法的开放式词汇学习

获取原文

摘要

Pinyin-to-character (P2C) conversion is the core component of pinyin-based Chinese input method engine (IME). However, the conversion is seriously compromised by the ambiguities of Chinese characters corresponding to pinyin as well as the predefined fixed vocabularies. To alleviate such inconveniences, we propose a neural P2C conversion model augmented by an online updated vocabulary with a sampling mechanism to support open vocabulary learning during IME working. Our experiments show that the proposed method outperforms commercial IMEs and state-of-the-art traditional models on standard corpus and true inputting history dataset in terms of multiple metrics and thus the online updated vocabulary indeed helps our IME effectively follows user inputting behavior.
机译:拼音到字符(P2C)转换是基于拼音的中文输入法引擎(IME)的核心组件。但是,由于与拼音相对应的汉字的歧义性以及预定义的固定词汇表严重地影响了转换。为了减轻此类不便,我们提出了一种神经P2C转换模型,该模型由在线更新的词汇表扩展,并带有采样机制以支持IME工作期间的开放式词汇表学习。我们的实验表明,所提出的方法在多种度量方面优于标准语料库和真实输入历史数据集上的商业IME和最先进的传统模型,因此在线更新的词汇表确实帮助我们的IME有效地遵循了用户输入行为。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号