...
首页> 外文期刊>International Journal of Image, Graphics and Signal Processing >Improved Frame Level Features and SVM Supervectors Approach for The Recogniton of Emotional States from Speech: Application to Categorical and Dimensional States
【24h】

Improved Frame Level Features and SVM Supervectors Approach for The Recogniton of Emotional States from Speech: Application to Categorical and Dimensional States

机译:改进的帧级特征和SVM超向量方法,用于从语音中识别情绪状态:应用于分类和维度状态

获取原文
           

摘要

The purpose of speech emotion recognition system is to classify speaker's utterances into different emotional states such as disgust, boredom, sadness, neutral and happiness. Speech features that are commonly used in speech emotion recognition (SER) rely on global utterance level prosodic features. In our work, we evaluate the impact of frame-level feature extraction. The speech samples are from Berlin emotional database and the features extracted from these utterances are energy, different variant of mel frequency cepstrum coefficients (MFCC), velocity and acceleration features. The idea is to explore the successful approach in the literature of speaker recognition GMM-UBM to handle with emotion identification tasks. In addition, we propose a classi?cation scheme for the labeling of emotions on a continuous dimensional-based approach.
机译:语音情感识别系统的目的是将说话者的话语分为厌恶,无聊,悲伤,中立和幸福等不同的情感状态。语音情感识别(SER)中常用的语音功能依赖于整体话语水平的韵律功能。在我们的工作中,我们评估帧级特征提取的影响。语音样本来自柏林情感数据库,从这些语音中提取的特征是能量,梅尔频率倒谱系数(MFCC)的不同变体,速度和加速度特征。这个想法是探索说话人识别GMM-UBM文献中成功处理情感识别任务的成功方法。此外,我们提出了一种基于连续维的方法来标记情感的分类方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号