首页> 外国专利> METHODS AND APPARATUSES FOR EMBEDDING WORD CONSIDERING CONTEXTUAL AND MORPHOSYNTACTIC INFORMATION

METHODS AND APPARATUSES FOR EMBEDDING WORD CONSIDERING CONTEXTUAL AND MORPHOSYNTACTIC INFORMATION

机译:考虑语境和形态学信息的方法和设备用于嵌入单词

摘要

The present invention relates to a word embedding method and apparatus in consideration of context information and morphological information of a word, and a word embedding method according to an embodiment of the present invention does not know out of vocabulary (OOV) in a sentence to be learned. Processing a sentence by replacing it with an unknown token of, inputting a character of a target word excluding the unregistered word from the processed sentence as an input of a context character model to be learned, Combining the surrounding context vectors with respect to the surrounding words of the target word in the sentence and setting the context text model as an initial state; And predicted embedding of the target word generated by connecting a forward hidden state and a backward hidden state calculated from the context text model and real embedding of the target word. ) To minimize the error between, and learning the context text model.
机译:本发明涉及用于考虑一个单词的上下文信息和形态信息的单词嵌入方法和装置,并且根据本发明实施例的单词嵌入方法不知道句子中的词汇(OOV)学到了。通过用未知的令牌替换它来处理句子,输入从处理后的句子中的目标字的字符作为要学习的上下文字符模型的输入,将周围的上下文向量相对于周围的单词组合句子中的目标字并将上下文文本模型设置为初始状态;并预测通过连接前向隐藏状态和从上下文文本模型计算的向后隐藏状态而生成的目标字的嵌入,并对目标字的实际嵌入来。 )以最小化与上下文文本模型之间的误差。

著录项

  • 公开/公告号KR102227939B1

    专利类型

  • 公开/公告日2021-03-15

    原文格式PDF

  • 申请/专利权人

    申请/专利号KR1020190038587

  • 申请日2019-04-02

  • 分类号G06F40/20;G06N20;

  • 国家 KR

  • 入库时间 2022-08-24 17:42:17

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号