Emotional Speech Recognition Based on Weighted Distance Optimization System

ElBedwehy Mona Nagy; Behery G. M.; Elbarougy Reda

首页> 外文期刊>International Journal of Pattern Recognition and Artificial Intelligence >Emotional Speech Recognition Based on Weighted Distance Optimization System

【24h】

Emotional Speech Recognition Based on Weighted Distance Optimization System

机译：基于加权距离优化系统的情绪语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Human emotion plays a major role in expressing their feelings through speech. Emotional speech recognition is an important research field in the human-computer interaction. Ultimately, the endowing machines that perceive the users' emotions will enable a more intuitive and reliable interaction.The researchers presented many models to recognize the human emotion from the speech. One of the famous models is the Gaussian mixture model (GMM). Nevertheless, GMM may sometimes have one or more of its components as ill-conditioned or singular covariance matrices when the number of features is high and some features are correlated. In this research, a new system based on a weighted distance optimization (WDO) has been developed for recognizing the emotional speech. The main purpose of the WDO system (WDOS) is to address the GMM shortcomings and increase the recognition accuracy. We found that WDOS has achieved considerable success through a comparative study of all emotional states and the individual emotional state characteristics. WDOS has a superior performance accuracy of 86.03% for the Japanese language. It improves the Japanese emotion recognition accuracy by 18.43% compared with GMM and k-mean.

机译：人类的情感在通过演讲表达自己的感受方面发挥着重要作用。情绪语音识别是人机互动中的重要研究领域。最终，感知用户情绪的遗传机器将实现更直观和可靠的互动。研究人员提出了许多模型来认识到讲话的人类情感。其中一个着名的模型是高斯混合模型（GMM）。然而，当特征数量高并且一些特征相关时，GMM可能有一个或多个组件作为不良协方差矩阵。在本研究中，已经开发了一种基于加权距离优化（WDO）的新系统，用于识别情绪语音。 WDO系统（WDOS）的主要目的是解决GMM缺点并提高识别准确性。我们发现WDO通过对所有情绪状态和个人情绪状态特征的比较研究取得了相当大的成功。 WDOS的性能准确性优于86.03％。它与GMM和K均值相比，它将日本情感识别准确性提高了18.43％。

著录项

来源
《International Journal of Pattern Recognition and Artificial Intelligence》 |2020年第11期|2050027.1-2050027.20|共20页
作者
ElBedwehy Mona Nagy; Behery G. M.; Elbarougy Reda;
展开▼
作者单位

Damietta Univ Math Dept Fac Sci Dumyat 34511 Egypt;

Damietta Univ Math Dept Fac Sci Dumyat 34511 Egypt;

Damietta Univ Math Dept Fac Sci Dumyat 34511 Egypt;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Gaussian mixture model; Emotional speech recognition; weighted distance; clustering; expectation maximization;

机译：高斯混合模型;情绪语音识别;加权距离;聚类;期望最大化;

相似文献

外文文献
中文文献
专利

1. A distance weighted linear regression classifier based on optimized distance calculating approach for face recognition [J] . Tang Linlin, Lu Huifen, Pang Zhen, Multimedia Tools and Applications . 2019,第22期

机译：基于优化距离计算方法的距离加权线性回归分类器
2. An improved maximum model distance approach for HMM-based speech recognition systems [J] . He QH., Man KF., Tang KS., Pattern Recognition: The Journal of the Pattern Recognition Society . 2000,第10期

机译：基于HMM的语音识别系统的改进的最大模型距离方法
3. A combined cepstral distance method for emotional speech recognition [J] . Quan Changqin, Zhang Bin, Sun Xiao, International Journal of Advanced Robotic Systems . 2017,第4期

机译：情绪语音识别的组合临床距离方法
4. Weighted Feature Fusion Based Emotional Recognition for Variable-length Speech using DNN [C] . Sifan Wu, Fei Li, Pengyuan Zhang International Wireless Communications and Mobile Computing Conference . 2019

机译：基于加权特征融合的DNN变长语音情感识别
5. Speech Based Machine Learning Models for Emotional State Recognition and PTSD Detection [D] . Banerjee, Debrup. 2017

机译：基于语音的机器学习模型用于情绪状态识别和PTSD检测
6. Jaccard distance based weighted sparse representation for coarse-to-fine plant species recognition [O] . Shanwen Zhang, Xiaowei Wu, Zhuhong You -1

机译：基于雅卡德距离的加权稀疏表示用于从粗到细的植物物种识别
7. A Log-Index Weighted Cepstral Distance Measure for Speech Recognition [O] . Zheng Fang (郑方, Wu Wenhu (吴文虎, Fang Ditang (方棣棠 2015

机译：用于语音识别的对数加权倒谱距离测度

Emotional Speech Recognition Based on Weighted Distance Optimization System

摘要

著录项

相似文献

相关主题

期刊订阅