首页> 外文会议>INTERSPEECH 2012 >The log-Gabor method: speech classification using spectrogram image analysis
【24h】

The log-Gabor method: speech classification using spectrogram image analysis

机译:log-gabor方法:使用频谱图图像分析语音分类

获取原文

摘要

We explored the suitability of the log-Gabor method, a speech analysis method inspired by Ezzat e.a. (2007), for automatic classification of personality and likability traits in speech. The core idea underlying the log-Gabor method is to treat the spectrogram as an image of spectro-temporal information. The image is transformed into Gabor energy values using the two-dimensional logarithmic Gabor transform, which is a standard feature extraction method in visual, texture analysis. The aggregated energy values are mapped onto classes by means of a support vector machine (SVM). The log-Gabor method performed above baseline on the INTERSPEECH Personality and Likability Sub-Challenges Development sets and comparable to baseline for the Test sets. These results support further investigation of the- log-Gabor method as a method for extracting perceptual cues from speech.
机译:我们探讨了Log-Gabor方法的适用性,一种由Ezzat E.A启发的语音分析方法。 (2007),用于自动分类个性和言论中的可爱性状。逻辑Gabor方法的核心思想是将频谱图视为光谱 - 时间信息的图像。使用二维对数Gabor变换将图像转换为Gabor能量值,这是视觉,纹理分析中的标准特征提取方法。通过支持向量机(SVM)将聚合的能量值映射到类上。在基线上高于基线的Log-Gabor方法,与测试集的基线相当的基线。这些结果支持进一步调查,作为从语音中提取感知提示的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号