首页> 外文期刊>Journal of the American Society for Information Science >SemGraph: Extracting Keyphrases Following a Novel Semantic Graph-Based Approach
【24h】

SemGraph: Extracting Keyphrases Following a Novel Semantic Graph-Based Approach

机译:SemGraph:根据一种新颖的基于语义图的方法提取关键短语

获取原文
获取原文并翻译 | 示例
           

摘要

Keyphrases represent the main topics a text is about. In this article, we introduce SemGraph, an unsupervised algorithm for extracting keyphrases from a collection of texts based on a semantic relationship graph. The main novelty of this algorithm is its ability to identify semantic relationships between words whose presence is statistically significant. Our method constructs a co-occurrence graph in which words appearing in the same document are linked, provided their presence in the collection is statistically significant with respect to a null model. Furthermore, the graph obtained is enriched with information from WordNet. We have used the most recent and standardized benchmark to evaluate the system ability to detect the keyphrases that are part of the text. The result is a method that achieves an improvement of 5.3% and 7.28% in F measure over the two labeled sets of keyphrases used in the evaluation of SemEval-2010.
机译:关键短语代表文本的主要主题。在本文中,我们介绍了SemGraph,这是一种基于语义关系图从文本集合中提取关键短语的无监督算法。该算法的主要新颖之处在于它能够识别存在于统计上的单词之间的语义关系。我们的方法构建一个共现图,其中出现在同一文档中的单词被链接起来,只要它们在集合中的存在相对于null模型在统计上是有意义的。此外,从WordNet获得的图形丰富了信息。我们已经使用了最新的标准化基准来评估系统检测文本中的关键短语的能力。结果是,与在SemEval-2010评估中使用的两个标记的关键短语集相比,F度量的改进方法分别达到5.3%和7.28%。

著录项

  • 来源
  • 作者单位

    NLP & IR Group, Dpto. Lenguajes y Sistemas Informaticos, Universidad Nacional de Educacion a Distancia (UNED), Juan del Rosal, 16. 28040 Madrid, Spain;

    NLP & IR Group, Dpto. Lenguajes y Sistemas Informaticos, Universidad Nacional de Educacion a Distancia (UNED), Juan del Rosal, 16. 28040 Madrid, Spain;

    NLP & IR Group, Dpto. Lenguajes y Sistemas Informaticos, Universidad Nacional de Educacion a Distancia (UNED), Juan del Rosal, 16. 28040 Madrid, Spain;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号