首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Comparison of Methods for Emotion Dimensions Estimation in Speech Using a Three-Layered Model
【24h】

Comparison of Methods for Emotion Dimensions Estimation in Speech Using a Three-Layered Model

机译:三层模型的语音情感维度估计方法的比较

获取原文
获取原文并翻译 | 示例
           

摘要

This paper proposes a three-layer model for estimating the expressed emotions in a speech signal based on a dimensional approach. Several estimators are adopted for estimating the three emotion dimensions (valence, activation, and dominance) for a speech signal. These estimators were designed to predict emotion dimensions from acoustic features directly. However, the acoustic features correlates to valence dimension are less numerous, less strong, and the valence dimension has being particularly difficult to be predicted. The ultimate goal of this study is to improve the dimensional approach in order to precisely predict the valence dimension. The proposed model consists of three layers: acoustic features, semantic primitives, emotion dimensions respectively. In this paper, we first compared several popular estimation methods and evaluated their performance by applying them using the traditional two-layered model and the proposed three-layered model. The experimental results show that the proposed three-layered model using fuzzy inference system and KNN as an estimator outperforms the traditional two-layered model using the same estimators.
机译:本文提出了一种基于维度方法的三层模型,用于估计语音信号中表达的情绪。采用几种估计器来估计语音信号的三个情感维度(价,激活和支配)。这些估计器旨在直接根据声学特征预测情绪维度。然而,与化合价相关的声学特征数量少,强度低,并且化合价很难被预测。这项研究的最终目标是改进尺寸方法,以便精确预测价价尺寸。所提出的模型包括三层:声学特征,语义原语,情感维度。在本文中,我们首先比较了几种流行的估计方法,并通过使用传统的两层模型和建议的三层模型将其应用来评估它们的性能。实验结果表明,提出的以模糊推理系统和KNN作为估计量的三层模型优于使用相同估计量的传统两层模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号