...
首页> 外文期刊>International Journal of Pattern Recognition and Artificial Intelligence >Emotional Speech Recognition Based on Weighted Distance Optimization System
【24h】

Emotional Speech Recognition Based on Weighted Distance Optimization System

机译:基于加权距离优化系统的情绪语音识别

获取原文
获取原文并翻译 | 示例
           

摘要

Human emotion plays a major role in expressing their feelings through speech. Emotional speech recognition is an important research field in the human-computer interaction. Ultimately, the endowing machines that perceive the users' emotions will enable a more intuitive and reliable interaction.The researchers presented many models to recognize the human emotion from the speech. One of the famous models is the Gaussian mixture model (GMM). Nevertheless, GMM may sometimes have one or more of its components as ill-conditioned or singular covariance matrices when the number of features is high and some features are correlated. In this research, a new system based on a weighted distance optimization (WDO) has been developed for recognizing the emotional speech. The main purpose of the WDO system (WDOS) is to address the GMM shortcomings and increase the recognition accuracy. We found that WDOS has achieved considerable success through a comparative study of all emotional states and the individual emotional state characteristics. WDOS has a superior performance accuracy of 86.03% for the Japanese language. It improves the Japanese emotion recognition accuracy by 18.43% compared with GMM and k-mean.
机译:人类的情感在通过演讲表达自己的感受方面发挥着重要作用。情绪语音识别是人机互动中的重要研究领域。最终,感知用户情绪的遗传机器将实现更直观和可靠的互动。研究人员提出了许多模型来认识到讲话的人类情感。其中一个着名的模型是高斯混合模型(GMM)。然而,当特征数量高并且一些特征相关时,GMM可能有一个或多个组件作为不良协方差矩阵。在本研究中,已经开发了一种基于加权距离优化(WDO)的新系统,用于识别情绪语音。 WDO系统(WDOS)的主要目的是解决GMM缺点并提高识别准确性。我们发现WDO通过对所有情绪状态和个人情绪状态特征的比较研究取得了相当大的成功。 WDOS的性能准确性优于86.03%。它与GMM和K均值相比,它将日本情感识别准确性提高了18.43%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号