首页> 外文会议>International conference on computational linguistics >Direct vs. indirect evaluation of distributional thesauri
【24h】

Direct vs. indirect evaluation of distributional thesauri

机译:分配叙词表的直接或间接评估

获取原文

摘要

With the success of word embedding methods in various Natural Language Processing tasks, all the fields of distributional semantics have experienced a renewed interest. Beside the famous word2vec, recent studies have presented efficient techniques to build distributional thesaurus; in particular, Claveau et al. (2014) have already shown that Information Retrieval (IR) tools and concepts can be successfully used to build a thesaurus. In this paper, we address the problem of the evaluation of such thesauri or embedding models. Several evaluation scenarue are considered: direct evaluation through reference lexicons and specially crafted datasets, and indirect evaluation through a third party tasks, namely lexical subsitution and Information Retrieval. For this latter task, we adopt the query expansion framework proposed by Claveau and Kijak (2016). Through several experiments, we first show that the recent techniques for building distributional thesaurus outperform the word2vec approach, whatever the evaluation scenario. We also highlight the differences between the evaluation scenarii, which may lead to very different conclusions when comparing distributional models. Last, we study the effect of some parameters of the distributional models on these various evaluation scenarii.
机译:随着单词嵌入方法在各种自然语言处理任务中的成功应用,分布语义学的所有领域都重新引起了人们的兴趣。除了著名的word2vec之外,最近的研究还提出了建立分布式同义词库的有效技术。特别是,Claveau等。 (2014年)已经表明,信息检索(IR)工具和概念可以成功地用于构建同义词库。在本文中,我们解决了对叙词表或嵌入模型进行评估的问题。考虑了几种评估方案:通过参考词典和特制数据集进行直接评估,以及通过第三方任务(即词汇替换和信息检索)进行间接评估。对于后一个任务,我们采用Claveau和Kijak(2016)提出的查询扩展框架。通过几个实验,我们首先表明,无论评估方案如何,用于构建分布式同义词库的最新技术都优于word2vec方法。我们还强调了评估场景之间的差异,这在比较分布模型时可能得出非常不同的结论。最后,我们研究了分布模型的某些参数对这些各种评估场景的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号