首页> 外文会议>Globalex Workshop on Linked Lexicography >Building Sense Representations in Danish by Combining Word Embeddings with Lexical Resources
【24h】

Building Sense Representations in Danish by Combining Word Embeddings with Lexical Resources

机译:通过将词嵌入与词汇资源相结合来建立丹麦语的语义表征

获取原文

摘要

Our aim is to identify suitable sense representations for NLP in Danish. We investigate sense inventories that correlate with human interpretations of word meaning and ambiguity as typically described in dictionaries and wordnets and that are well reflected distributionally as expressed in word embeddings. To this end, we study a number of highly ambiguous Danish nouns and examine the effectiveness of sense representations constructed by combining vectors from a distributional model with the information from a wordnet. We establish representations based on centroids obtained from wordnet synsets and example sentences as well as representations established via a clustering approach; these representations are tested in a word sense disambiguation task. We conclude that the more information extracted from the wordnet entries (example sentence, definition, semantic relations) the more successful the sense representation vector.
机译:我们的目的是为丹麦语中的NLP识别合适的感官表示形式。我们研究与人类对单词含义和歧义的解释相关的感觉清单,如字典和词网中通常描述的那样,并且可以很好地分布在词嵌入中,反映出来。为此,我们研究了许多高度歧义的丹麦名词,并研究了通过将分布模型中的向量与词网中的信息相结合而构造的有义表示的有效性。我们基于从词网同义词集和例句中获得的质心以及通过聚类方法建立的表示来建立表示;这些表述在词义消歧任务中进行了测试。我们得出结论,从词网条目(示例句子,定义,语义关系)中提取的信息越多,意义表示向量就越成功。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号