首页> 外文会议>Web Information System and Application Conference >An Unsupervised Approach for Constructing Word Similarity Network
【24h】

An Unsupervised Approach for Constructing Word Similarity Network

机译:一种修建词相似度网络的无监督方法

获取原文

摘要

To evaluate how much a pair of entities or documents are similar is a common problem for current applications. Most approaches for this problem are based on the co-occurrence. However, different terms or words may represent the same entity or similar semantic in the real world since a concept often has more than one way of expression. Existing works always focus on computing semantic relatedness of words. But relatedness cannot reflect the similarity most of the time, on the other hand, most of their corpus are from common data sources such as Wikipedia and are not useful for the specialized vocabulary. In this paper, we propose a novel unsupervised approach for evaluating the semantic similarity between words by mapping texts to vector space and computing prior information. In our approach, we construct a model that can identify the words representing the same entity in special context even though they don't belong to the same concept. At last, we construct a network of words in which paths between words can reflect the evolution process of concepts. Our experimental results show that that our approach gives an effective solution to discover the semantic relationship between words, especially for words in specialty domains.
机译:为了评估一对实体或文档相似的数量是当前应用程序的常见问题。大多数用于这个问题的方法都是基于共同发生。然而,由于概念通常具有多种表达方式,不同的术语或单词可以在现实世界中表示相同的实体或类似的语义。现有的作品总是专注于计算单词的语义相关性。但是,相关性在大部分时间都无法反映相似性,另一方面,他们的大多数语料库来自常见的数据来源,例如维基百科,并且对专业词汇并不有用。在本文中,我们提出了一种通过将文本映射到向量空间和计算先前信息来评估单词之间的语义相似性的一种新颖的无监督方法。在我们的方法中,我们构建一个模型,即使它们不属于相同的概念,也可以识别代表特殊上下文中同一实体的单词。最后,我们构建一个单词网络,其中单词之间的路径可以反映概念的演变过程。我们的实验结果表明,我们的方法提供了一种有效的解决方案来发现词语之间的语义关系,特别是在特种域中的单词。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号