首页> 外文期刊>IEICE Transactions on Information and Systems >Speech Summarization: An Approach through Word Extraction and a Method for Evaluation
【24h】

Speech Summarization: An Approach through Word Extraction and a Method for Evaluation

机译:语音摘要:一种通过单词提取的方法和一种评估方法

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we propose a new method of automatic speech summarization for each utterance, where a set of words that maximizes a summarization score is extracted from automatic speech transcriptions. The summarization score indicates the appropriateness of summarized sentences. This extraction is achieved by using a dynamic programming technique according to a target summarization ratio. This ratio is the number of characters/words in the summarized sentence divided by the number of characters/words in the original sentence. The extracted set of words is then connected to build a summarized sentence. The summarization score consists of a word significance measure, linguistic likelihood, and a confidence measure. This paper also proposes a new method of measuring summarization accuracy based on a word network expressing manual summarization results. The summarization accuracy of each automatic summarization is calculated by comparing it with the most similar word string in the network. Japanese broadcast-news speech, transcribed using a large-vocabulary continuous-speech recognition (LVCSR) system, is summarized and evaluated using our proposed method with 20, 40, 60, 70 and 80% summarization ratios. Experimental results reveal that the proposed method can effectively extract relatively important information by removing redundant or irrelevant information.
机译:在本文中,我们提出了一种针对每种话语的自动语音摘要的新方法,其中从自动语音转录中提取最大化摘要分数的一组单词。总结分数表明总结句子的适当性。通过使用动态编程技术根据目标摘要比率实现此提取。该比率是摘要句子中的字符/单词数除以原始句子中的字符/单词数。然后将提取出的一组单词连接起来以构建一个概括的句子。总结分数包括单词重要性度量,语言可能性和置信度度量。本文还提出了一种基于表示人工汇总结果的单词网络的新的汇总精度测量方法。通过将其与网络中最相似的字串进行比较,可以计算出每次自动汇总的汇总精度。使用大词汇量连续语音识别(LVCSR)系统转录的日本广播新闻语音,使用我们提出的方法以20、40、60、70和80%的摘要率进行总结和评估。实验结果表明,该方法可以去除多余或不相关的信息,从而有效地提取相对重要的信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号