首页> 外国专利> Method and apparatus for bilingual word association, method and apparatus for training bilingual word correspondence model

Method and apparatus for bilingual word association, method and apparatus for training bilingual word correspondence model

机译:双语单词关联的方法和装置,双语单词对应模型的训练方法和装置

摘要

The present invention provides method and apparatus for bilingual word alignment, method and apparatus for training bilingual word alignment model. The method for bilingual word alignment, comprising: training a bilingual word alignment model using a word-aligned labeled bilingual corpus; word-aligning a plurality of bilingual sentence pairs in a unlabeled bilingual corpus using said bilingual word alignment model; determining whether the word alignment of each of said plurality of bilingual sentence pairs is correct, and if it is correct, adding the bilingual sentence pair into the labeled bilingual corpus and removing the bilingual sentence pair from the unlabeled bilingual corpus; retraining the bilingual word alignment model using the expanded labeled bilingual corpus; and re-word-aligning the remaining bilingual sentence pairs in the unlabeled bilingual corpus using the retrained bilingual word alignment model.
机译:本发明提供了用于双语单词对齐的方法和装置,用于培训双语单词对齐模型的方法和装置。一种双语单词对齐的方法,包括:使用单词对齐的标记的双语语料库训练双语单词对齐模型;使用所述双语单词对齐模型对未标记的双语语料库中的多个双语句子对进行单词对齐;确定所述多个双语句子对中的每一个的单词对齐是否正确,如果正确,则将所述双语句子对添加到所述标记的双语语料库中,并从所述未标记的双语语料库中删除所述双语句子对;使用扩展的标记双语语料库重新训练双语单词对齐模型;然后使用重新训练的双语单词对齐模型对未标记的双语语料库中的其余双语句子对进行单词对齐。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号