Dialog-Context Aware end-to-end Speech Recognition

机译：对话上下文感知端到端语音识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Existing speech recognition systems are typically built at the sentence level, although it is known that dialog context, e.g. higher-level knowledge that spans across sentences or speakers, can help the processing of long conversations. The recent progress in end-to-end speech recognition systems promises to integrate all available information (e.g. acoustic, language resources) into a single model, which is then jointly optimized. It seems natural that such dialog context information should thus also be integrated into the end-to-end models to improve recognition accuracy further. In this work, we present a dialog-context aware speech recognition model, which explicitly uses context information beyond sentence-level information, in an end-to-end fashion. Our dialog-context model captures a history of sentence-level contexts, so that the whole system can be trained with dialog-context information in an end-to-end manner. We evaluate our proposed approach on the Switchboard conversational speech corpus, and show that our system outperforms a comparable sentence-level end-to-end speech recognition system.

机译：现有的语音识别系统通常建立在句子级别，尽管已知对话上下文，例如语音对话。跨越句子或说话者的高级知识可以帮助处理长时间的对话。端到端语音识别系统的最新进展有望将所有可用信息（例如声学，语言资源）集成到单个模型中，然后对其进行联合优化。因此，这样的对话上下文信息也应该集成到端到端模型中以进一步提高识别准确性，这似乎是很自然的。在这项工作中，我们提出了一个对话上下文感知的语音识别模型，该模型以端到端的方式显式地使用句子信息之外的上下文信息。我们的对话上下文模型捕获了句子级上下文的历史，因此整个系统可以端到端的方式使用对话上下文信息进行训练。我们评估了在Switchboard会话语音语料库上提出的方法，并表明我们的系统优于可比的句子级端到端语音识别系统。

著录项

来源
《2018 IEEE Spoken Language Technology Workshop》|2018年|434-440|共7页
会议地点 Athens(GR)
作者
Suyoun Kim; Florian Metze;
展开▼
作者单位

Electrical Computer Engineering, Carnegie Mellon University;

Language Technologies Institute, Carnegie Mellon University;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Speech recognition; Context modeling; Decoding; Training; Data models; Computational modeling; Switches;

机译：语音识别;上下文建模;解码;训练;数据模型;计算建模;开关;;

相似文献

外文文献
中文文献
专利

1. Bridging automatic speech recognition and psycholinguistics: Extending Shortlist to an end-to-end model of human speech recognition (L) [J] . Odette Scharenborg, Louis ten Bosch, Lou Boves, The Journal of the Acoustical Society of America . 2003,第6期

机译：桥接自动语音识别和心理语言学：将候选清单扩展到人类语音识别的端到端模型（L）
2. Representation transfer learning from deep end-to-end speech recognition networks for the classification of health states from speech [J] . Benjamin Sertolli, Zhao Ren, Bjoern W. Schuller, Computer speech and language . 2021,第Jula期

机译：从言语中，从深端到端语音识别网络中的代表转移学习
3. An End-to-End Deep Learning Approach to Simultaneous Speech Dereverberation and Acoustic Modeling for Robust Speech Recognition [J] . Bo Wu, Kehuang Li, Fengpei Ge, Selected Topics in Signal Processing, IEEE Journal of . 2017,第8期

机译：端到端深度学习方法可同时进行语音去混响和声学建模，以实现可靠的语音识别
4. Dialog-Context Aware end-to-end Speech Recognition [C] . Suyoun Kim, Florian Metze Spoken Language Technology Workshop . 2018

机译：Dialog-Context意识到的端到端语音识别
5. End-to-End Speech Recognition on Conversations [D] . Kim, Suyoun . 2019

机译：对话的端到端语音识别
6. Dynamic Acoustic Unit Augmentation with BPE-Dropout for Low-Resource End-to-End Speech Recognition [O] . Aleksandr Laptev, Andrei Andrusenko, Ivan Podluzhny, 2021

机译：用BPE-ropout进行动态声学单元增强用于低资源端到端语音识别
7. Speaker-Aware Training of Attention-Based End-to-End Speech Recognition Using Neural Speaker Embeddings [O] . Aku Rouhe, Tuomas Kaseva, Mikko Kurimo 2020

机译：使用神经扬声器嵌入的扬声器感知注意力的关注结束语音识别

Dialog-Context Aware end-to-end Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅