首页> 外文会议>IEEE Congress on Evolutionary Computation >Mining Graph-based Features in Multi-objective Framework for Microblog Summarization
【24h】

Mining Graph-based Features in Multi-objective Framework for Microblog Summarization

机译:多目标框架中基于图的特征的微博总结

获取原文

摘要

Nowadays, micro-blogging sites are getting popular due to the involvement of a large number of users. In the case of natural disasters, a significant amount of relevant information (giving crucial information) are present amongst the tweets. Therefore, there is a need to develop a system that summarizes relevant tweets by extracting informative tweets. In the current paper, we have proposed an unsupervised approach for summarizing the relevant tweets namely, MOOTweetSumm+, which automatically selects the informative tweets. Several tweet-scoring measures: (a) anti-redundancy measuring the dissimilarity between tweets; (b) similarity with outputs provided by LexRank (a graph-based method measuring tweet importance based on the concept of eigen-vector centrality in a graph); (c) BM25 based ranking function; (d) tf-idf based ranking function; (e) length of the tweet; (f) re-tweet count, are simultaneously optimized utilizing a binary differential evolution algorithm. Further, two different versions of the LexRank, utilizing syntactic and semantic similarity, have also been explored. For evaluation, four different disaster-event related datasets are used, and performance is measured in terms of ROUGE scores. An ablation study is also performed to determine which set of measures is best suited for different datasets. From the results obtained, it is clearly evident that our approach improves by 13.2% and 5.8% in terms of ROUGE-2 and ROUGE-L scores, over the existing approaches, respectively.
机译:如今,由于大量用户的参与,微博客网站变得越来越流行。在自然灾害的情况下,推文中存在大量相关信息(提供关键信息)。因此,需要开发一种通过提取信息性推文来总结相关推文的系统。在本文中,我们提出了一种无监督的方法来汇总相关的推文,即MOOTweetSumm +,它会自动选择信息性的推文。几种推文评分措施:(a)反冗余,衡量推文之间的差异; (b)与LexRank提供的输出相似(基于图的特征向量中心性概念,基于图的方法来衡量推文的重要性); (c)基于BM25的排名功能; (d)基于tf-idf的排名功能; (e)推文的长度; (f)重推计数,同时利用二进制差分进化算法进行优化。此外,还研究了利用语法和语义相似性的LexRank的两个不同版本。为了进行评估,使用了四个与灾难事件相关的不同数据集,并根据ROUGE分数衡量了性能。还进行了消融研究,以确定最适合于不同数据集的哪一组量度。从获得的结果来看,很明显,与现有方法相比,我们的方法在ROUGE-2和ROUGE-L评分方面分别提高了13.2%和5.8%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号