Improving the performance of the speaker emotion recognition based on low dimension prosody features vector

Ashishkumar Prabhakar Gudmalwar; Ch V Rama Rao; Anirban Dutta

首页> 外文期刊>International journal of speech technology >Improving the performance of the speaker emotion recognition based on low dimension prosody features vector

【24h】

Improving the performance of the speaker emotion recognition based on low dimension prosody features vector

机译：基于低维韵律特征向量的说话人情绪识别性能

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speaker emotion recognition is an important research issue as it finds lots of applications in human-robot interaction, computer-human interaction, etc. This work deals with the recognition of emotion of the speaker from speech utterance. For that features like pitch, log energy, zero crossing rate, and first three formant frequencies are used. Feature vectors are constructed using the 11 statistical parameters of each feature. The Artificial Neural Network (ANN) is chosen as a classifier owing to its universal function approximation capabilities. In ANN based classifier, the time required for training the network as well as for classification depends upon the dimension of feature vector. This work focused on development of a speaker emotion recognition system using prosody features as well as reduction of dimensionality of feature vectors. Here, principle component analysis (PCA) is used for feature vector dimensionality reduction. Emotional prosody speech and transcription from Linguistic Data Consortium (LDC) and Berlin emotional databases are considered for evaluating the performance of proposed approach for seven types of emotion recognition. The performance of the proposed method is compared with existing approaches and better performance is obtained with proposed method. From experimental results it is observed that 75.32% and 84.5% recognition rate is obtained for Berlin emotional database and LDC emotional speech database respectively.

机译：说话人情感识别是一个重要的研究问题，因为它在人机交互，计算机人机交互等方面都有很多应用。这项工作涉及从语音中识别说话人的情感。为此，使用了音高，对数能量，过零率和前三个共振峰频率等特征。使用每个特征的11个统计参数构建特征向量。由于其通用函数逼近功能，因此选择了人工神经网络（ANN）作为分类器。在基于ANN的分类器中，训练网络以及进行分类所需的时间取决于特征向量的维数。这项工作的重点是开发使用韵律特征的说话人情感识别系统以及降低特征向量的维数。这里，主成分分析（PCA）用于减少特征向量维数。语言数据协会（LDC）和柏林情感数据库的情感韵律语音和转录被认为可用于评估所提出的七种情感识别方法的性能。将所提出的方法的性能与现有方法进行比较，并获得更好的性能。从实验结果可以看出，柏林情感数据库和LDC情感语音数据库的识别率分别为75.32％和84.5％。

著录项

来源
《International journal of speech technology》 |2019年第3期|521-531|共11页
作者
Ashishkumar Prabhakar Gudmalwar; Ch V Rama Rao; Anirban Dutta;
展开▼
作者单位

National Institute of Technology Meghalaya Shillong Meghalaya India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Prosody; PCA; Emotion recognition; Recognition rate;

机译：韵律PCA;情绪识别;识别率;

相似文献

外文文献
中文文献
专利

1. Speech emotion recognition using hybrid spectral-prosodic features of speech signal/glottal waveform, metaheuristic-based dimensionality reduction, and Gaussian elliptical basis function network classifier [J] . Daneshfar Fatemeh, Kabudian Seyed Jahanshah, Neekabadi Abbas Applied Acoustics . 2020,第Sepa期

机译：语音情感识别使用语音信号/光学波形的混合谱 - 韵律特征，基于血管训练的维数减少和高斯椭圆形基函数网络分类器
2. PERFORMANCE IMPROVEMENT AND ANALYSIS OF SPEAKER INDEPENDENT EMOTION RECOGNITION SYSTEM USING I-VECTORS [J] . NAGA PADMAJA JAGINI, RAJESWAR RAO. R Journal of Theoretical and Applied Information Technology . 2018,第13期

机译：I-向量的扬声器独立情绪识别系统的性能改进和分析
3. PROSODIC FEATURE BASED TEXT DEPENDENT SPEAKER RECOGNITION USING MACHINE LEARNING ALGORITHMS [J] . Sunil Agrawal, Shruti A.K., C. Rama Krishna International Journal of Engineering Science and Technology . 2010,第10期

机译：机器学习算法的基于特征特征的文本相关说话人识别
4. Emotions in speech - experiments with prosody and quality features in speech for use in categorical and dimensional emotion recognition environments [C] . Borchert, M., Dusterhoft, . 2005

机译：语音中的情感-具有语音韵律和质量特征的实验，用于类别和维度情感识别环境
5. Prosodic feature recognition of the 'yes' response by the non-native speaker of English and its implications for ESL. [D] . Brownworth, Barbara A. 1999

机译：非英语母语者对“是”响应的韵律特征识别及其对ESL的影响。
6. Improving the signal subtle feature extraction performance based on dual improved fractal box dimension eigenvectors [O] . Xiang Chen, Jingchao Li, Hui Han, 2018

机译：基于双重改进的分形盒维特征向量提高信号微妙特征提取性能
7. Speaker dependent emotion recognition using prosodic supervectors [O] . López Moreno, Ignacio, Ortego Resa, Carlos, González-Rodríguez, Joaquín, 2009

机译：使用韵律超向量的说话人相关情绪识别

Improving the performance of the speaker emotion recognition based on low dimension prosody features vector

摘要

著录项

相似文献

相关主题

期刊订阅