首页> 外文会议>International Conference on Communication and Electronics Systems >A comparative study of silence and non silence regions of speech signal using prosody features
【24h】

A comparative study of silence and non silence regions of speech signal using prosody features

机译:使用韵律特征对语音信号的静音和非静音区域进行比较研究

获取原文

摘要

The objective of the present work is to develop an emotion based speech recognition system with the knowledge of speech statistics and prosodic features. A detailed study over silence and non silence regions of a speech signal helps in increasing the performance accuracy of the system. The idea of removing the silence regions from the speech information with a predefined threshold is taken forward in this paper. Prosody features deals with the auditory qualities of the sound and it can also reflect aspects of meaning, intention and emotional state of the speaker. Emotion depicting features like energy, pitch and duration are considered. These features are classified using a conventional GMM. The performance of the present system is evaluated using Berlin emotion speech corpus. The percentage-accuracy of the system has increased from 21 to 36%.
机译:本工作的目的是利用语音统计知识和韵律特征开发基于情感的语音识别系统。对语音信号的静音和非静音区域的详细研究有助于提高系统的性能精度。本文提出了从语音信息中去除具有预定阈值的静音区域的想法。韵律功能处理声音的听觉质量,还可以反映说话人的含义,意图和情感状态。情感描述功能,如能量,音调和持续时间被考虑。使用常规GMM对这些功能进行分类。使用柏林情感语音语料对本系统的性能进行评估。系统的百分比准确性已从21%增加到36%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号