首页> 外文期刊>IEEE transactions on visualization and computer graphics >Visual Exploration of Neural Document Embedding in Information Retrieval: Semantics and Feature Selection
【24h】

Visual Exploration of Neural Document Embedding in Information Retrieval: Semantics and Feature Selection

机译:信息检索中神经文档嵌入的视觉探索:语义和特征选择

获取原文
获取原文并翻译 | 示例
           

摘要

Neural embeddings are widely used in language modeling and feature generation with superior computational power. Particularly, neural document embedding - converting texts of variable-length to semantic vector representations - has shown to benefit widespread downstream applications, e.g., information retrieval (IR). However, the black-box nature makes it difficult to understand how the semantics are encoded and employed. We propose visual exploration of neural document embedding to gain insights into the underlying embedding space, and promote the utilization in prevalent IR applications. In this study, we take an IR application-driven view, which is further motivated by biomedical IR in healthcare decision-making, and collaborate with domain experts to design and develop a visual analytics system. This system visualizes neural document embeddings as a configurable document map and enables guidance and reasoning; facilitates to explore the neural embedding space and identify salient neural dimensions (semantic features) per task and domain interest; and supports advisable feature selection (semantic analysis) along with instant visual feedback to promote IR performance. We demonstrate the usefulness and effectiveness of this system and present inspiring findings in use cases. This work will help designers/developers of downstream applications gain insights and confidence in neural document embedding, and exploit that to achieve more favorable performance in application domains.
机译:神经嵌入以卓越的计算能力广泛用于语言建模和特征生成。特别地,神经文档嵌入(将可变长度的文本转换为语义向量表示)已经显示出有益于广泛的下游应用,例如信息检索(IR)。但是,黑盒性质使得难以理解语义是如何编码和采用的。我们建议对神经文档嵌入进行视觉探索,以深入了解潜在的嵌入空间,并促进在流行的IR应用程序中的利用。在这项研究中,我们采用了IR应用程序驱动的观点,该观点受到生物医学IR在医疗保健决策中的进一步推动,并与领域专家合作设计和开发视觉分析系统。该系统将神经文档嵌入可视化为可配置的文档图,并可以进行指导和推理;促进探索神经嵌入空间并根据任务和领域兴趣识别出显着的神经维度(语义特征);并支持建议的功能选择(语义分析)以及即时的视觉反馈,以提高IR性能。我们展示了该系统的有用性和有效性,并在用例中提出了令人鼓舞的发现。这项工作将帮助下游应用程序的设计者/开发人员获得对神经文档嵌入的见识和信心,并利用它们在应用程序领域中实现更出色的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号