首页> 外文会议>SLSP 2013 >An Investigation of Single-Pass ASR System Combination for Spoken Language Understanding
【24h】

An Investigation of Single-Pass ASR System Combination for Spoken Language Understanding

机译:对口语理解单通ASR系统组合的调查

获取原文

摘要

This paper studies the benefits provided by a single-pass Automatic Speech Recognition (ASR) exchange-based combination approach for spoken dialog system. Three famous open-source ASR systems are used to experiment this approach in the framework of Spoken Language Understanding (SLU). On the ASR side, single-pass ASR systems are used with an online acoustic model adaptation using the previous utterances said by a speaker. On the SLU side, a competitive CRF-based SLU system is applied on outputs of ASR system to obtain the semantic concepts. The evaluation is done on the French PORT-MEDIA test data in terms of both Word Error Rate (WER) and Concept Error Rate (CER). While the best single pass system used alone shows a CER of 29.8% for a WER of 22.8%, single-pass ASR exchange-based combination reaches a CER of 27.3% for a WER of 26%. This CER is only slightly higher than the one reached by a 5-passes ASR system which obtained a CER of 26.8% for a WER of 22.8% in better conditions, i.e. better acoustic model adaptation made on all the speech utterances said by a speaker, advanced feature extraction techniques and search graph rescoring using language model with higher order.
机译:本文研究了单通式自动语音识别(ASR)基于互联的对话系统的组合方法提供的益处。三种着名的开源ASR系统用于在口语理解框架中尝试这种方法(SLU)。在ASR侧,使用扬声器表示,单通ASR系统与在线声学模型适配一起使用。在SLU侧,应用基于CRF的SLU系统,应用于ASR系统的输出以获得语义概念。根据字错误率(WER)和概念错误率(CER),在法国端口媒体测试数据上进行评估。虽然单独使用的最佳单通系统显示22.8%的22.8%的CER,但单通ASR交换基组合达到27.3%的27.3%,为26%。该CER仅略高于5级通过ASR系统达到的,在更好的条件下获得22.8%的22.8%的CER,即在所有语音话语上制作的更好的声学模型适应,高级功能提取技术和搜索图使用具有更高阶的语言模型进行繁殖。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号