...
首页> 外文期刊>Journal of biomedical informatics. >Identifying synonymy between relational phrases using word embeddings
【24h】

Identifying synonymy between relational phrases using word embeddings

机译:使用词嵌入识别关系短语之间的同义词

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Many text mining applications in the biomedical domain benefit from automatic clustering of relational phrases into synonymous groups, since it alleviates the problem of spurious mismatches caused by the diversity of natural language expressions. Most of the previous work that has addressed this task of synonymy resolution uses similarity metrics between relational phrases based on textual strings or dependency paths, which, for the most part, ignore the context around the relations. To overcome this shortcoming, we employ a word embedding technique to encode relational phrases. We then apply the k-means algorithm on top of the distributional representations to cluster the phrases. Our experimental results show that this approach outperforms state-of-the-art statistical models including latent Dirichlet allocation and Markov logic networks. (C) 2015 Elsevier Inc. All rights reserved.
机译:生物医学领域中的许多文本挖掘应用程序都受益于将关系短语自动聚类为同义词组,因为它缓解了由自然语言表达的多样性引起的虚假不匹配的问题。解决同义解析任务的大多数以前的工作都使用基于文本字符串或依赖路径的关系短语之间的相似性度量,这些度量在大多数情况下会忽略关系周围的上下文。为了克服此缺点,我们采用词嵌入技术对关系短语进行编码。然后,我们在分布表示的顶部应用k-means算法对短语进行聚类。我们的实验结果表明,该方法优于包括潜在Dirichlet分配和Markov逻辑网络在内的最新统计模型。 (C)2015 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号