...
首页> 外文期刊>Expert systems with applications >iSpreadRank: Ranking sentences for extraction-based summarization using feature weight propagation in the sentence similarity network
【24h】

iSpreadRank: Ranking sentences for extraction-based summarization using feature weight propagation in the sentence similarity network

机译:iSpreadRank:在句子相似性网络中使用特征权重传播对句子进行排序,以进行基于提取的摘要

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Sentence extraction is a widely adopted text summarization technique where the most important sentences are extracted from docu-ment(s) and presented as a summary. The first step towards sentence extraction is to rank sentences in order of importance as in the summary. This paper proposes a novel graph-based ranking method, iSpreadRank, to perform this task. iSpreadRank models a set of topic-related documents into a sentence similarity network. Based on such a network model, iSpreadRank exploits the spreading activation theory to formulate a general concept from social network analysis: the importance of a node in a network (i.e., a sentence in this paper) is determined not only by the number of nodes to which it connects, but also by the importance of its connected nodes. The algorithm recursively re-weights the importance of sentences by spreading their sentence-specific feature scores throughout the network to adjust the importance of other sentences. Consequently, a ranking of sentences indicating the relative importance of sentences is reasoned. This paper also develops an approach to produce a generic extractive summary according to the inferred sentence ranking. The proposed summarization method is evaluated using the DUC 2004 data set, and found to perform well. Experimental results show that the proposed method obtains a ROUGE-1 score of 0.38068, which represents a slight difference of 0.00156, when compared with the best participant in the DUC 2004 evaluation.
机译:句子提取是一种广泛采用的文本摘要技术,其中,最重要的句子是从文档中提取出来的,并作为摘要呈现。句子提取的第一步是按照摘要中的顺序对句子进行排名。本文提出了一种新颖的基于图的排名方法iSpreadRank来执行此任务。 iSpreadRank将一组与主题相关的文档建模到句子相似性网络中。基于这样的网络模型,iSpreadRank利用传播激活理论从社交网络分析中制定了一个一般概念:网络中节点的重要性(即本文中的句子)不仅取决于要连接的节点数,还取决于节点的数量。它所连接的节点,还取决于其所连接节点的重要性。该算法通过在整个网络中扩展其特定于句子的特征评分来调整其他句子的重要性,从而对句子的重要性进行递归重新加权。因此,对指示句子的相对重要性的句子的等级进行推理。本文还提出了一种根据推断的句子排名产生通用提取摘要的方法。使用DUC 2004数据集对提出的汇总方法进行了评估,发现该方法表现良好。实验结果表明,与DUC 2004评估中的最佳参与者相比,该方法的ROUGE-1得分为0.38068,相差0.00156。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号