首页> 美国卫生研究院文献>Sensors (Basel Switzerland) >The Feature Extraction Based on Texture Image Information for Emotion Sensing in Speech
【2h】

The Feature Extraction Based on Texture Image Information for Emotion Sensing in Speech

机译:基于纹理图像信息的语音情感特征提取

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In this paper, we present a novel texture image feature for Emotion Sensing in Speech (ESS). This idea is based on the fact that the texture images carry emotion-related information. The feature extraction is derived from time-frequency representation of spectrogram images. First, we transform the spectrogram as a recognizable image. Next, we use a cubic curve to enhance the image contrast. Then, the texture image information (TII) derived from the spectrogram image can be extracted by using Laws' masks to characterize emotional state. In order to evaluate the effectiveness of the proposed emotion recognition in different languages, we use two open emotional databases including the Berlin Emotional Speech Database (EMO-DB) and eNTERFACE corpus and one self-recorded database (KHUSC-EmoDB), to evaluate the performance cross-corpora. The results of the proposed ESS system are presented using support vector machine (SVM) as a classifier. Experimental results show that the proposed TII-based feature extraction inspired by visual perception can provide significant classification for ESS systems. The two-dimensional (2-D) TII feature can provide the discrimination between different emotions in visual expressions except for the conveyance pitch and formant tracks. In addition, the de-noising in 2-D images can be more easily completed than de-noising in 1-D speech.
机译:在本文中,我们提出了一种用于语音情感感知(ESS)的新颖纹理图像功能。这个想法是基于这样的事实,即纹理图像带有与情感有关的信息。特征提取是从频谱图图像的时频表示中得出的。首先,我们将频谱图转换为可识别的图像。接下来,我们使用三次曲线来增强图像对比度。然后,可以通过使用Laws的蒙版来刻画情感状态来提取从频谱图图像中获取的纹理图像信息(TII)。为了评估提议的情绪识别在不同语言中的有效性,我们使用了两个开放的情绪数据库,包括柏林情绪语音数据库(EMO-DB)和eNTERFACE语料库,以及一个自我记录数据库(KHUSC-EmoDB),以评估跨公司绩效。使用支持向量机(SVM)作为分类器,提出了所提出的ESS系统的结果。实验结果表明,基于视觉感知的基于TII的特征提取可以为ESS系统提供重要的分类。二维(2-D)TII功能可区分视觉表情中不同的情绪,除了传送音高和共振峰轨迹。另外,与一维语音中的降噪相比,可以更轻松地完成二维图像中的降噪。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号