SRL-ESA-TextSum: A text summarization approach based on semantic role labeling and explicit semantic analysis

Mohamed Muhidin; Oussalah Mourad

首页> 外文期刊>Information Processing & Management >SRL-ESA-TextSum: A text summarization approach based on semantic role labeling and explicit semantic analysis

【24h】

SRL-ESA-TextSum: A text summarization approach based on semantic role labeling and explicit semantic analysis

机译：SRL-ESA-TextSum：一种基于语义角色标记和显式语义分析的文本汇总方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic text summarization attempts to provide an effective solution to today's unprecedented growth of textual data. This paper proposes an innovative graph-based text summarization framework for generic single and multi document summarization. The summarizer benefits from two well-established text semantic representation techniques; Semantic Role Labelling (SRL) and Explicit Semantic Analysis (ESA) as well as the constantly evolving collective human knowledge in Wikipedia. The SRL is used to achieve sentence semantic parsing whose word tokens are represented as a vector of weighted Wikipedia concepts using ESA method. The essence of the developed framework is to construct a unique concept graph representation underpinned by semantic role-based multi-node (under sentence level) vertices for summarization. We have empirically evaluated the summarization system using the standard publicly available dataset from Document Understanding Conference 2002 (DUC 2002). Experimental results indicate that the proposed summarizer outperforms all state-of-the-art related comparators in the single document summarization based on the ROUGE-1 and ROUGE-2 measures, while also ranking second in the ROUGE-1 and ROUGE-SU4 scores for the multi-document summarization. On the other hand, the testing also demonstrates the scalability of the system, i.e., varying the evaluation data size is shown to have little impact on the summarizer performance, particularly for the single document summarization task. In a nutshell, the findings demonstrate the power of the role-based and vectorial semantic representation when combined with the crowd-sourced knowledge base in Wikipedia.

机译：自动文本摘要试图为当今文本数据的空前增长提供有效的解决方案。本文提出了一种创新的基于图的文本摘要框架，用于通用的单文档和多文档摘要。摘要器受益于两种完善的文本语义表示技术；语义角色标记（SRL）和显式语义分析（ESA）以及维基百科中不断发展的集体人类知识。 SRL用于使用ESA方法实现句子语义解析，其单词标记表示为加权Wikipedia概念的向量。开发框架的本质是构造一个独特的概念图表示，以基于语义角色的多节点（句子级别）顶点为基础进行概括。我们使用2002年文档理解大会（DUC 2002）的标准公共可用数据集对经验总结系统进行了评估。实验结果表明，在基于ROUGE-1和ROUGE-2度量的单个文档汇总中，拟议的汇总器的性能优于所有最新的比较器，同时在ROUGE-1和ROUGE-SU4评分中排名第二。多文档摘要。另一方面，测试还证明了系统的可伸缩性，即，显示评估数据大小的变化对汇总器的性能影响很小，特别是对于单个文档汇总任务而言。简而言之，这些发现证明了当与Wikipedia中的众包知识库结合时，基于角色和矢量语义表示的功能。

著录项

来源
《Information Processing & Management》 |2019年第4期|1356-1372|共17页
作者
Mohamed Muhidin; Oussalah Mourad;
展开▼
作者单位

Aston Univ, Sch Engn & Appl Sci, Comp Sci, Birmingham B4 7ET, W Midlands, England;

Univ Oulu, Fac Informat Technol Comp Sci, Ctr Ubiquitous Comp, POB 4500, Oulu 90014, Finland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Text summarization; Semantic role labeling; Wikipedia concepts; Concept graphs; Semantic similarity; Iterative ranking algorithm;

机译：文本摘要;语义角色标签;维基百科概念;概念图;语义相似度;迭代排序算法;

相似文献

外文文献
中文文献
专利

1. SRL-ESA-TextSum: A text summarization approach based on semantic role labeling and explicit semantic analysis [J] . Mohamed Muhidin, Oussalah Mourad Information Processing & Management . 2019,第4期

机译：SRL-esa-textsum：一种基于语义角色标记和显式语义分析的文本摘要方法
2. SRL-GSM: A Hybrid Approach based on Semantic Role Labeling and General Statistic Method for Text Summarization [J] . L. Suanmali, N. Salim, M.S. Binwahlan Journal of Applied Sciences . 2010,第3期

机译：SRL-GSM：一种基于语义角色标记的混合方法和文本摘要的一般统计方法
3. Towards patent text analysis based on semantic role labelling [J] . Yanqing He, Ying Li, Lingen Meng, International Journal of Computational Science and Engineering . 2017,第3a4期

机译：基于语义角色标记的专利文本分析
4. An Iterative Graph-Based Generic Single and Multi Document Summarization Approach Using Semantic Role Labeling and Wikipedia Concepts [C] . Muhidin Mohamed, Mourad Oussalah . 2016

机译：基于迭代图的语义角色标签和维基百科概念的通用单文档和多文档摘要方法
5. A CCG-Based Method for Training a Semantic Role Labeler in the Absence of Explicit Syntactic Training Data. [D] . Boxwell, Stephen A. 2011

机译：在没有显式句法训练数据的情况下，基于CCG的方法来训练语义角色标签。
6. A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method [O] . Illhoi Yoo, Xiaohua Hu, Il-Yeol Song 2007

机译：基于相干图的生物医学文献语义聚类和总结方法及新的评价方法
7. SRL-ESA-TextSum: A text summarization approach based on semantic role labeling and explicit semantic analysis [O] . Muhidin Mohamed, Mourad Oussalah 2019

机译：SRL-esa-textsum：一种基于语义角色标记和显式语义分析的文本摘要方法

SRL-ESA-TextSum: A text summarization approach based on semantic role labeling and explicit semantic analysis

摘要

著录项

相似文献

相关主题

期刊订阅