首页> 外文会议>First grand challenge and workshop on human multimodal language 2018 >ASR-based Features for Emotion Recognition: A Transfer Learning Approach
【24h】

ASR-based Features for Emotion Recognition: A Transfer Learning Approach

机译:基于ASR的情绪识别功能:一种转移学习方法

获取原文
获取原文并翻译 | 示例

摘要

During the last decade, the applications of signal processing have drastically improved with deep learning. However areas of affecting computing such as emotional speech synthesis or emotion recognition from spoken language remains challenging. In this paper, we investigate the use of a neural Automatic Speech Recognition (ASR) as a feature extractor for emotion recognition. We show that these features outperform the eGeMAPS feature set to predict the valence and arousal emotional dimensions, which means that the audio-to-text mapping learned by the ASR system contains information related to the emotional dimensions in spontaneous speech. We also examine the relationship between first layers (closer to speech) and last layers (closer to text) of the ASR and valence/arousal.
机译:在过去的十年中,深度学习极大地改善了信号处理的应用。然而,影响计算的领域,例如情感语音合成或来自口头语言的情感识别仍然具有挑战性。在本文中,我们研究了使用神经自动语音识别(ASR)作为情感识别的特征提取器。我们表明,这些功能优于eGeMAPS功能集,可预测化合价和唤醒情绪维度,这意味着ASR系统学习的音频到文本映射包含与自发语音中情绪维度相关的信息。我们还检查了ASR的第一层(更靠近语音)和最后一层(最靠近文本)与价位/配音之间的关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号