首页> 外文会议>2018 IEEE Spoken Language Technology Workshop >Dialog-Context Aware end-to-end Speech Recognition
【24h】

Dialog-Context Aware end-to-end Speech Recognition

机译:对话上下文感知端到端语音识别

获取原文
获取原文并翻译 | 示例

摘要

Existing speech recognition systems are typically built at the sentence level, although it is known that dialog context, e.g. higher-level knowledge that spans across sentences or speakers, can help the processing of long conversations. The recent progress in end-to-end speech recognition systems promises to integrate all available information (e.g. acoustic, language resources) into a single model, which is then jointly optimized. It seems natural that such dialog context information should thus also be integrated into the end-to-end models to improve recognition accuracy further. In this work, we present a dialog-context aware speech recognition model, which explicitly uses context information beyond sentence-level information, in an end-to-end fashion. Our dialog-context model captures a history of sentence-level contexts, so that the whole system can be trained with dialog-context information in an end-to-end manner. We evaluate our proposed approach on the Switchboard conversational speech corpus, and show that our system outperforms a comparable sentence-level end-to-end speech recognition system.
机译:现有的语音识别系统通常建立在句子级别,尽管已知对话上下文,例如语音对话。跨越句子或说话者的高级知识可以帮助处理长时间的对话。端到端语音识别系统的最新进展有望将所有可用信息(例如声学,语言资源)集成到单个模型中,然后对其进行联合优化。因此,这样的对话上下文信息也应该集成到端到端模型中以进一步提高识别准确性,这似乎是很自然的。在这项工作中,我们提出了一个对话上下文感知的语音识别模型,该模型以端到端的方式显式地使用句子信息之外的上下文信息。我们的对话上下文模型捕获了句子级上下文的历史,因此整个系统可以端到端的方式使用对话上下文信息进行训练。我们评估了在Switchboard会话语音语料库上提出的方法,并表明我们的系统优于可比的句子级端到端语音识别系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号