首页> 外文会议>International Conference on Computer Science and Network Technology >Experimental comparison of text information based punctuation recovery algorithms in real data
【24h】

Experimental comparison of text information based punctuation recovery algorithms in real data

机译:基于文本信息的标点恢复算法在真实数据中的实验比较

获取原文

摘要

Punctuation recovery is very important for automatic speech recognition (ASR). It greatly improves readability of transcripts and user experience, and facilitates following natural language processing tasks. The text information based method is one of the basic solutions of punctuation recovery. For analyzing the features of these algorithms, improving them and using them to develop practical system, this paper evaluates text information based punctuation recovery algorithms (HELM, CRF, RNNLM and GTI) in real data. Results show that GTI outperforms other algorithms for punctuation recovery, and ASR's error is the main cause of performance degradation of all punctuation recovery algorithms. Finally, some suggestions are given.
机译:标点恢复对于自动语音识别(ASR)非常重要。它极大地提高了成绩单的可读性和用户体验,并有助于遵循自然语言处理任务。基于文本信息的方法是标点符号恢复的基本解决方案之一。为了分析这些算法的特性,对其进行改进并用于开发实用系统,本文对真实数据中基于文本信息的标点恢复算法(HELM,CRF,RNNLM和GTI)进行了评估。结果表明,GTI的性能优于其他标点恢复算法,而ASR的错误是所有标点恢复算法性能下降的主要原因。最后,给出了一些建议。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号