...
首页> 外文期刊>Information Sciences: An International Journal >Enhanced word embeddings using multi-semantic representation through lexical chains
【24h】

Enhanced word embeddings using multi-semantic representation through lexical chains

机译:通过词汇链使用多语义表示增强的单词嵌入

获取原文
获取原文并翻译 | 示例
           

摘要

The relationship between words in a sentence often tells us more about the underlying semantic content of a document than its actual words, individually. In this work, we propose two novel algorithms, called Flexible Lexical Chain II and Fixed Lexical Chain II. These algorithms combine the semantic relations derived from lexical chains, prior knowledge from lexical databases, and the robustness of the distributional hypothesis in word embeddings as building blocks forming a single system. In short, our approach has three main contributions: (i) a set of techniques that fully integrate word embeddings and lexical chains; (ii) a more robust semantic representation that considers the latent relation between words in a document; and (iii) lightweight word embeddings models that can be extended to any natural language task. We intend to assess the knowledge of pretrained models to evaluate their robustness in the document classification task. The proposed techniques are tested against seven word embeddings algorithms using five different machine learning classifiers over six scenarios in the document classification task. Our results show the integration between lexical chains and word embeddings representations sustain state-of-the-art results, even against more complex systems. (C) 2020 Elsevier Inc. All rights reserved.
机译:句子中单词之间的关系通常会告诉我们文档的底层语义内容,而不是单独的实际单词。在这项工作中,我们提出了两种新颖的算法,称为灵活的词汇链II和固定词汇链II。这些算法与词汇链中的语义关系,从词汇数据库中的先验知识,以及Word Embeddings中分布假设的鲁棒性作为形成单个系统的构建块。简而言之,我们的方法有三个主要贡献:(i)一套完全集成了单词嵌入和词汇链的技术; (ii)一种更强大的语义表示,它考虑了文档中的单词之间的潜在关系; (iii)轻量级单词嵌入式模型,可以扩展到任何自然语言任务。我们打算评估预借鉴模型的知识,以评估文档分类任务中的鲁棒性。在文档分类任务中使用五个不同的机器学习分类器,在六个不同的机器学习分类器中测试了七个单词嵌入算法的测试。我们的结果显示了词汇链和Word Embeddings表示之间的集成维持最先进的结果,即使针对更复杂的系统。 (c)2020 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号