Krimping texts for better summarization

机译：拼写文本以进行更好的总结

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automated text summarization is aimed at extracting essential information from original text and presenting it in a minimal, often predefined, number of words. In this paper, we introduce a new approach for unsupervised extractive summarization, based on the Minimum Description Length (MDL) principle, using the Krimp dataset compression algorithm (Vreeken et al., 2011). Our approach represents a text as a transactional dataset, with sentences as transactions, and then describes it by itemsets that stand for frequent sequences of words. The summary is then compiled from sentences that compress (and as such, best describe) the document. The problem of summarization is reduced to the maximal coverage, following the assumption that a summary that best describes the original text, should cover most of the word sequences describing the document. We solve it by a greedy algorithm and present the evaluation results.

机译：自动文本摘要旨在从原始文本中提取基本信息并将其呈现在最小，通常预定义的单词中。在本文中，我们使用KRIMP DataSet压缩算法（Vreeken等，2011），介绍了一种用于无监督的提取总结的新方法（MDL）原则（Vreeken等，2011）。我们的方法表示作为事务数据集的文本，其中句子作为事务，然后通过代表频繁的单词序列的项目集来描述它。然后将摘要从压缩（以及最佳描述）文档中的句子编译。在最能描述原始文本的摘要之后，概述的问题减少到最大覆盖范围，应该涵盖描述文档的大多数单词序列。我们通过贪婪的算法解决并提出评估结果。

著录项

来源
《Conference on empirical methods in natural language processing》|2015年|1931-1935|共5页
会议地点
作者
Marina Litvak; Natalia Vanetik; Mark Last;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Text Summarization Challenge 2 Text summarization evaluation at NTCIR Workshop 3 [J] . Manabu Okumura, Takahiro Fukusima, Hidetsugu Nanba, ACM SIGIR FORUM . 2004,第1期

机译：文字摘要挑战2 NTCIR研讨会3的文字摘要评估
2. ArA*summarizer: An Arabic text summarization system based on subtopic segmentation and using an A* algorithm for reduction [J] . Expert Systems . 2020,第2期

机译：ArA * summarizer：基于子主题分段并使用A *算法进行归约的阿拉伯文本摘要系统
3. Karci summarization: A simple and effective approach for automatic text summarization using Karci entropy [J] . Cengiz Hark, Ali Karci Information Processing & Management . 2020,第3期

机译：Karci摘要：使用Karci熵进行文本自动摘要的一种简单有效的方法
4. Krimping texts for better summarization [C] . Marina Litvak, Natalia Vanetik, Mark Last Conference on empirical methods in natural language processing . 2015

机译：Krimping文本以更好的摘要
5. A Hierarchical Extractive Text Summarization Approach [D] . Alshahrani, Saud Shari. 2021

机译：分层提取文本摘要方法
6. Towards Answering Biological Questions with Experimental Evidence: Automatically Identifying Text that Summarize Image Content in Full-Text Articles [O] . Hong Yu 2006

机译：尝试用实验证据回答生物学问题：自动识别全文文章中包含图像内容的文本
7. Krimping texts for better summarization [O] . Marina Litvak, Natalia Vanetik, Mark Last 2015

机译：Krimping文本以便更好地总结

Krimping texts for better summarization

摘要

著录项

相似文献

相关主题

期刊订阅