首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Exploring Attention Mechanism for Acoustic-based Classification of Speech Utterances into System-directed and Non-system-directed
【24h】

Exploring Attention Mechanism for Acoustic-based Classification of Speech Utterances into System-directed and Non-system-directed

机译:探索基于语音的语音言语分为系统指向性和非系统指向性的注意机制

获取原文

摘要

Voice controlled virtual assistants (VAs) are now available in smartphones, cars, and standalone devices in homes. In most cases, the user needs to first "wake-up" the VA by saying a particular word/phrase every time he/she wants the VA to do something. Eliminating the need for saying the wake-up word for every interaction could improve the user experience. This would require the VA to have the capability of understanding whether the user is talking to it or not. In other words, the challenge is to distinguish between system-directed and non-system-directed speech utterances. In this paper, we present a number of neural network architectures for tackling this classification problem based on using only the acoustic signal. It is shown that a model comprised of convolutional, recurrent, and feed-forward layers can achieve an equal error rate (EER) of below 20% for this task. In addition, we investigate the use of an attention mechanism for helping the model to focus on the more important parts of the signal and to improve handling of variable length inputs sequences. The results show that the proposed attention mechanism significantly improves the model accuracy achieving an EER of 16.25% and 15.62% on two distinct realistic datasets.
机译:语音控制虚拟助手(VA)现在可在智能手机,汽车和家庭中的独立设备中使用。在大多数情况下,用户每次想让VA做某事时,首先需要通过说出特定的单词/短语来“唤醒” VA。消除对每个交互都说唤醒词的需要,可以改善用户体验。这将要求VA具有了解用户是否在与之交谈的能力。换句话说,挑战在于区分系统定向的语音发声和非系统定向的语音发声。在本文中,我们提出了许多基于仅使用声音信号来解决此分类问题的神经网络体系结构。结果表明,由卷积层,递归层和前馈层组成的模型可以实现低于20%的等效错误率(EER)。此外,我们研究了注意力机制的使用,以帮助模型将注意力集中在信号的更重要部分上,并改善对可变长度输入序列的处理。结果表明,所提出的注意力机制显着提高了模型准确性,在两个不同的真实数据集上实现了16.25%和15.62%的EER。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号