首页> 外文期刊>Mobile information systems >Replacing Out-of-Vocabulary Words with an Appropriate Synonym Based on Word2VnCR
【24h】

Replacing Out-of-Vocabulary Words with an Appropriate Synonym Based on Word2VnCR

机译:用基于Word2VNCR的适当同义词替换词汇单词

获取原文
           

摘要

The most typical problem in an analysis of natural language is finding synonyms of out-of-vocabulary (OOV) words. When someone tries to understand a sentence containing an OOV word, the person determines the most appropriate meaning of a replacement word using the meanings of co-occurrence words under the same context based on the conceptual system learned. In this study, a word-to-vector and conceptual relationship (Word2VnCR) algorithm is proposed that replaces an OOV word leading to an erroneous morphemic analysis with an appropriate synonym. TheWord2VnCR algorithm is an improvement over the conventional Word2Vec algorithm, which has a problem in suggesting a replacement word by not determining the similarity of the word. After word-embedding learning is conducted using the learning dataset, the replacement word candidates of the OOV word are extracted. The semantic similarities of the extracted replacement word candidates are measured with the surrounding neighboring words of the OOV word, and a replacement word having the highest similarity value is selected as a replacement. To evaluate the performance of the proposed Word2VnCR algorithm, a comparative experiment was conducted using the Word2VnCR and Word2Vec algorithms. As the experimental results indicate, the proposed algorithm shows a higher accuracy than the Word2Vec algorithm.
机译:自然语言分析中最典型的问题是找到词汇外(OOV)单词的同义词。当有人试图了解包含OOV Word的句子时,该人使用基于概念系统的相同上下文下的共同发生词的含义来确定替换词的最合适的含义。在这项研究中,提出了一种向导词和概念关系(Word2VNCR)算法,其替换了oov词,导致具有适当的同义词的错误语素分析。 TheWord2VNCR算法是对传统Word2Vec算法的改进,这在不确定单词的相似性时呈现替换单词的问题。使用学习数据集进行嵌入学习后,提取OOV字的替换词候选。用OOV字的周围相邻单词测量提取的替换词候选的语义相似性,并且选择具有最高相似性值的替换词作为替换。为了评估所提出的Word2VNCR算法的性能,使用Word2VNCR和Word2Vec算法进行比较实验。随着实验结果表明,所提出的算法显示比Word2Vec算法更高的精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号