首页> 外文会议>Language generation and evaluation workshop 2011 >A New Sentence Compression Dataset and Its Use in an Abstractive Generate-and-Rank Sentence Compressor
【24h】

A New Sentence Compression Dataset and Its Use in an Abstractive Generate-and-Rank Sentence Compressor

机译:新的句子压缩数据集及其在抽象生成和排名句子压缩器中的使用

获取原文
获取原文并翻译 | 示例

摘要

Sentence compression has attracted much interest in recent years, but most sentence compressors are extractive, i.e., they only delete words. There is a lack of appropriate datasets to train and evaluate abstractive sentence compressors, i.e., methods that apart from deleting words can also rephrase expressions. We present a new dataset that contains candidate extractive and abstractive compressions of source sentences. The candidate compressions are annotated with human judgements for grammaticality and meaning preservation. We discuss how the dataset was created, and how it can be used in generate-and-rank abstractive sentence compressors. We also report experimental results with a novel abstractive sentence compressor that uses the dataset.
机译:近年来,句子压缩吸引了很多兴趣,但是大多数句子压缩器都是提取性的,即它们仅删除单词。缺乏适当的数据集来训练和评估抽象句子压缩器,即除删除单词之外还可以重新表达表达式的方法。我们提出了一个新的数据集,其中包含源句子的候选提取和抽象压缩。候选压缩使用人类对语法和含义保留的判断进行注释。我们讨论了如何创建数据集,以及如何将其用于生成和排序抽象句子压缩器。我们还报告了使用数据集的新型抽象句子压缩器的实验结果。

著录项

  • 来源
  • 会议地点 Edinburgh(GB)
  • 作者单位

    Department of Informatics, Athens University of Economics and Business, Greece;

    Department of Informatics, Athens University of Economics and Business, Greece,Digital Curation Unit - IMIS, Research Center 'Athena', Greece;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号