Not All Synonyms Are Created Equal: Incorporating Similarity of Synonyms to Enhance Word Embeddings

机译：并非所有同义词都相等：创建同义词相似度以增强单词嵌入

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Traditional word embedding approaches learn semantic information from the associated contexts of words on large unlabeled corpora, which ignores a fact that synonymy between words happens often within different contexts in a corpus, so this relationship will not be well embedded into vectors. Furthermore, existing synonymy-based models directly incorporate synonyms to train word embeddings, but still neglect the similarity between words and corresponding synonyms. In this paper, we explore a novel approach that employs the similarity between words and corresponding synonyms to train and enhance word embeddings. To this purpose, we build two Synonymy Similarity Models (SSMs), named SSM-W and SSM-M respectively, which adopt different strategies to incorporate the similarity between words and corresponding synonyms during the training process. We evaluated our models for both Chinese and English. The results demonstrate that our models outperform the baselines on seven word similarity datasets. For the analogical reasoning and text classification tasks, our models also surpass all the baselines including a synonymy-based model.

机译：传统的词嵌入方法是从大型未标记的语料库上的词的关联上下文中学习语义信息的，这忽略了一个事实，即词之间的同义词经常发生在语料库的不同上下文中，因此这种关系将无法很好地嵌入向量中。此外，现有的基于同义词的模型直接结合了同义词来训练单词嵌入，但是仍然忽略了单词与相应同义词之间的相似性。在本文中，我们探索了一种新颖的方法，该方法利用单词和相应同义词之间的相似性来训练和增强单词嵌入。为此，我们建立了两个同义词相似度模型（SSM），分别命名为SSM-W和SSM-M，它们在训练过程中采用了不同的策略来融合单词和相应同义词之间的相似性。我们针对中文和英文评估了我们的模型。结果表明，我们的模型在七个词相似性数据集上的表现优于基线。对于类比推理和文本分类任务，我们的模型还超过了所有基线，包括基于同义词的模型。

著录项

来源
《International Joint Conference on Neural Networks》|2020年|1-8|共8页
会议地点
作者
Peiyang Liu; Wei Ye; Xiangyu Xi; Tong Wang; Jinglei Zhang; Shikun Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Task analysis; Thesauri; Silicon; Cognition; Semantics; Probabilistic logic;

机译：培训;任务分析;叙词表;硅;认知;语义;概率逻辑;

相似文献

外文文献
中文文献
专利

1. Enhancing Embedding-Based Chinese Word Similarity Evaluation with Concepts and Synonyms Knowledge [J] . Yin Fulian, Wang Yanyan, Liu Jianbo, Computer Modeling in Engineering & Sciences . 2020,第2期

机译：增强基于嵌入的中文词相似性评估，概念和同义词
2. SynoExtractor: A Novel Pipeline for Arabic Synonym Extraction Using Word2Vec Word Embeddings [J] . Rawan N. Al-Matham, Hend S. Al-Khalifa Complexity . 2021,第a期

机译：SynoExtractor：使用Word2Vec Word Embeddings的阿拉伯语义词义提取一个新型管道
3. Hierarchical Multi-Task Word Embedding Learning for Synonym Prediction [J] . Hongliang Fei, Shulong Tan, Ping Li SIGKDD explorations . 2019,第Udisk期

机译：分层多任务词嵌入学习以实现同义词预测
4. Dimensional Sentiment Analysis for Chinese words Based on synonym lexicon and Word Embedding [C] . Wei Cheng, Yuansheng Song, Yue Zhu, International conference on Asian language processing . 2016

机译：基于同义词词典和词嵌入的汉语词语维度情感分析
5. Resolving quasi-synonym relationships in automatic thesaurus construction using fuzzy rough sets and an inverse term frequency similarity function. [D] . Davault, Julius M., III. 2009

机译：使用模糊粗糙集和逆项频率相似性函数解决自动同义词库构建中的准同义词关系。
6. Using WordNet Synonym Substitution to Enhance UMLS Source Integration [O] . Kuo-Chuan Huang, James Geller, Michael Halper, -1

机译：使用WordNet同义词替换来增强UMLS源集成
7. SynoExtractor: A Novel Pipeline for Arabic Synonym Extraction Using Word2Vec Word Embeddings [O] . Rawan N. Al-Matham, Hend S. Al-Khalifa 2021

机译：SynoExtractor：使用Word2Vec Word Embeddings的阿拉伯语义词义提取一个新型管道

Not All Synonyms Are Created Equal: Incorporating Similarity of Synonyms to Enhance Word Embeddings

摘要

著录项

相似文献

相关主题

期刊订阅