首页> 外文期刊>IEEE transactions on visualization and computer graphics >Visual Exploration of Semantic Relationships in Neural Word Embeddings
【24h】

Visual Exploration of Semantic Relationships in Neural Word Embeddings

机译:神经词嵌入中语义关系的视觉探索

获取原文
获取原文并翻译 | 示例
           

摘要

Constructing distributed representations for words through neural language models and using the resulting vector spaces for analysis has become a crucial component of natural language processing (NLP). However, despite their widespread application, little is known about the structure and properties of these spaces. To gain insights into the relationship between words, the NLP community has begun to adapt high-dimensional visualization techniques. In particular, researchers commonly use t-distributed stochastic neighbor embeddings (t-SNE) and principal component analysis (PCA) to create two-dimensional embeddings for assessing the overall structure and exploring linear relationships (e.g., word analogies), respectively. Unfortunately, these techniques often produce mediocre or even misleading results and cannot address domain-specific visualization challenges that are crucial for understanding semantic relationships in word embeddings. Here, we introduce new embedding techniques for visualizing semantic and syntactic analogies, and the corresponding tests to determine whether the resulting views capture salient structures. Additionally, we introduce two novel views for a comprehensive study of analogy relationships. Finally, we augment t-SNE embeddings to convey uncertainty information in order to allow a reliable interpretation. Combined, the different views address a number of domain-specific tasks difficult to solve with existing tools.
机译:通过神经语言模型构造单词的分布式表示形式,并使用所得的向量空间进行分析,已成为自然语言处理(NLP)的重要组成部分。然而,尽管它们得到了广泛的应用,但对这些空间的结构和性质知之甚少。为了深入了解单词之间的关系,NLP社区已开始采用高维可视​​化技术。特别地,研究人员通常使用t分布随机邻居嵌入(t-SNE)和主成分分析(PCA)来创建二维嵌入,以分别评估整体结构和探索线性关系(例如单词类比)。不幸的是,这些技术通常会产生中等甚至误导的结果,并且无法解决特定于领域的可视化挑战,这对于理解词嵌入中的语义关系至关重要。在这里,我们介绍了用于可视化语义和句法类比的新嵌入技术,以及用于确定结果视图是否捕获显着结构的相应测试。此外,我们介绍了两种新颖的观点来全面研究类比关系。最后,我们增加t-SNE嵌入来传达不确定性信息,以便进行可靠的解释。结合在一起,不同的视图解决了许多现有工具难以解决的特定领域任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号