首页> 外文期刊>Indian Journal of Science and Technology >Speech Emotion Recognition: Performance Analysis based on Fused Algorithms and GMM Modelling
【24h】

Speech Emotion Recognition: Performance Analysis based on Fused Algorithms and GMM Modelling

机译:语音情感识别:基于融合算法和GMM建模的性能分析

获取原文
           

摘要

Background/Objectives: Speech emotion recognition (SER) is an important aspect of Human-Computer Interaction systems which is widely used in different sectors like healthcare, robotics, automatic call centres and distance education. Speech emotion recognition involves in depth analysis of the signal and identifying the appropriate emotion based on its trained database using extracted features. Method/Statistical Analysis: This paper aims in devising SER system using linear prediction of the causal part of the autocorrelation sequence (OSALPC) algorithm which has been proven to efficiently reduce noise along with Linear Frequency Cepstral Coefficients (LFCC), Linear Predictive Coding (LPC), MFCC, LPC using cepstrum for feature extraction. After extracting the feature vectors from the voice signal, it is modelled using Gaussian Mixture Models (GMM). The MAP (Maximum a posteriori) rule is used for decision making. Findings: Performance was analysed and our proposed system showed an overall efficiency of 89% when tested on German database (Emo-DB) for 7 emotions. The overall efficiency has proven to increase compared to the studies made up to date on the German Database. The highest emotion recognition rate was for SAD using fused algorithm which was 95.56%. Also results were tabulated and compared using Modified MFCC. A Graphical unit interface of the proposed system is also devised. Application/Improvements: The applications of speech emotion recognition are farfetched. Further scope of this work will be a comparison of the achieved recognition rate using algorithms with recognition rate achieved by humans.
机译:背景/目的:语音情感识别(SER)是人机交互系统的重要方面,已广泛应用于医疗,机器人,自动呼叫中心和远程教育等不同领域。语音情感识别涉及对信号的深度分析,并使用提取的特征基于其经过训练的数据库来识别适当的情感。方法/统计分析:本文旨在利用自相关序列(OSALPC)算法的因果部分的线性预测设计SER系统,该算法已被证明可有效降低噪声,并具有线性频率倒谱系数(LFCC),线性预测编码(LPC) ),MFCC,使用倒谱的LPC进行特征提取。从语音信号中提取特征向量后,使用高斯混合模型(GMM)对其进行建模。 MAP(最大后验)规则用于决策。结果:对性能进行了分析,当在德国数据库(Emo-DB)上测试7种情绪时,我们提出的系统显示出89%的整体效率。与迄今为止在德国数据库上进行的研究相比,已证明总体效率有所提高。融合算法对SAD的情感识别率最高,为95.56%。还用改进的MFCC将结果制成表格并进行比较。还设计了所提出系统的图形单元界面。应用程序/改进:语音情感识别的应用程序牵强。这项工作的进一步范围将是使用算法将获得的识别率与人类获得的识别率进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号