Mining Graph-based Features in Multi-objective Framework for Microblog Summarization

机译：多目标框架中基于图的特征的微博总结

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nowadays, micro-blogging sites are getting popular due to the involvement of a large number of users. In the case of natural disasters, a significant amount of relevant information (giving crucial information) are present amongst the tweets. Therefore, there is a need to develop a system that summarizes relevant tweets by extracting informative tweets. In the current paper, we have proposed an unsupervised approach for summarizing the relevant tweets namely, MOOTweetSumm+, which automatically selects the informative tweets. Several tweet-scoring measures: (a) anti-redundancy measuring the dissimilarity between tweets; (b) similarity with outputs provided by LexRank (a graph-based method measuring tweet importance based on the concept of eigen-vector centrality in a graph); (c) BM25 based ranking function; (d) tf-idf based ranking function; (e) length of the tweet; (f) re-tweet count, are simultaneously optimized utilizing a binary differential evolution algorithm. Further, two different versions of the LexRank, utilizing syntactic and semantic similarity, have also been explored. For evaluation, four different disaster-event related datasets are used, and performance is measured in terms of ROUGE scores. An ablation study is also performed to determine which set of measures is best suited for different datasets. From the results obtained, it is clearly evident that our approach improves by 13.2% and 5.8% in terms of ROUGE-2 and ROUGE-L scores, over the existing approaches, respectively.

机译：如今，由于大量用户的参与，微博客网站变得越来越流行。在自然灾害的情况下，推文中存在大量相关信息（提供关键信息）。因此，需要开发一种通过提取信息性推文来总结相关推文的系统。在本文中，我们提出了一种无监督的方法来汇总相关的推文，即MOOTweetSumm +，它会自动选择信息性的推文。几种推文评分措施：（a）反冗余，衡量推文之间的差异; （b）与LexRank提供的输出相似（基于图的特征向量中心性概念，基于图的方法来衡量推文的重要性）; （c）基于BM25的排名功能; （d）基于tf-idf的排名功能; （e）推文的长度; （f）重推计数，同时利用二进制差分进化算法进行优化。此外，还研究了利用语法和语义相似性的LexRank的两个不同版本。为了进行评估，使用了四个与灾难事件相关的不同数据集，并根据ROUGE分数衡量了性能。还进行了消融研究，以确定最适合于不同数据集的哪一组量度。从获得的结果来看，很明显，与现有方法相比，我们的方法在ROUGE-2和ROUGE-L评分方面分别提高了13.2％和5.8％。

著录项

来源
《IEEE Congress on Evolutionary Computation》|2020年|1-8|共8页
会议地点
作者
Naveen Saini; Sushil Kumar; Sriparna Saha; Pushpak Bhattacharyya;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Microblog; disaster-event; extractive summarization; multi-objective optimization; evolutionary algorithm; LexRank.;

机译：微博;灾害事件;摘要性提取;多目标优化;进化算法; LexRank;

相似文献

外文文献
中文文献
专利

1. Search engine reinforced semi-supervised classification and graph-based summarization of microblogs [J] . Yan Chen, Xiaoming Zhang, Zhoujun Li, Neurocomputing . 2015,第mara25期

机译：搜索引擎增强的半监督分类和基于图的微博摘要
2. EdgeSumm: Graph-based framework for automatic text summarization [J] . Wafaa S. El-Kassas, Cherif R. Salama, Ahmed A. Rafea, Information Processing & Management . 2020,第6期

机译：EdgeMumm：基于图形的自动文本摘要框架
3. Improving Graph-Based Summarization with HTML Tag and Metadata Features [J] . Dewi Wardani, Yuni Susanti Engineering Letters . 2020,第2期

机译：使用HTML标记和元数据功能提高基于图形的摘要
4. A time series clustering based framework for multimedia mining and summarization using audio features [C] . Regunathan Radhakrishnan, Ajay Divakaran, Ziyou Xiong ACM SIGMM international workshop on Multimedia information retrieval . 2004

机译：基于时间序列聚类的框架，用于使用音频功能进行多媒体挖掘和汇总
5. Realizing a feature-based framework for scientific data mining. [D] . Mehta, Sameep. 2006

机译：实现基于特征的科学数据挖掘框架。
6. A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method [O] . Illhoi Yoo, Xiaohua Hu, Il-Yeol Song 2007

机译：基于相干图的生物医学文献语义聚类和总结方法及新的评价方法
7. A Graph-Based Framework for Web Document Mining [O] . Adam Schenker, Horst Bunke, Mark Last, 2004

机译：基于图形文件挖掘的图形框架
8. PKUICST at TREC 2014 Microblog Track: Feature Extraction for Effective Microblog Search and Adaptive Clustering Algorithms for TTG. [R] . Lv, C., Fan, F., Qiang, R., 2014

机译：2014年TREC上的pKUICsT微博跟踪：TTG的有效微博搜索和自适应聚类算法的特征提取。

Mining Graph-based Features in Multi-objective Framework for Microblog Summarization

摘要

著录项

相似文献

相关主题

期刊订阅