首页> 外文期刊>International journal of speech technology >Classifying females' stressed and neutral voices using acoustic-phonetic analysis of vowels: an exploratory investigation with emergency calls
【24h】

Classifying females' stressed and neutral voices using acoustic-phonetic analysis of vowels: an exploratory investigation with emergency calls

机译:使用元音的语音分析对女性的压力和中性声音进行分类:带有紧急呼叫的探索性调查

获取原文
获取原文并翻译 | 示例
           

摘要

In the present exploratory study, we investigated acoustic-phonetic measures of spoken vowels for detection of female speech under conditions of stress. Eight authentic recorded calls to emergency services received from eight Finnish adult female speakers were chosen for the analysis. Based on the purpose of the call, the recordings were divided into two groups: the stressed group and the neutral group. We chose f0, H1-H2 and centre of gravity as acoustic-phonetic predictors for our final classification models; In previous studies, high fO has been associated with a stressed voice, but H1-H2 and centre of gravity have not previously been related to speech under stress. On the other hand, H1-H2 has been used to detect non-modal voice qualities, such as a creaky or breathy voice, and similar voice qualities have been observed in stressed speech. Furthermore, indications exist that in speech under stress, acoustic energy is concentrated in higher frequencies, which consequently increases the centre of gravity. We tested stress detection accuracy with three statistical classifiers: LDA, logistic regression and decision tree. Our results indicated that all the models performed better when they were trained using only the vowel I'll rather than training them with all Finnish vowels. The use of our best performing model, a logistic regression model based on /i/, yielded 88% correct classification, whereas a logistic regression model trained with all vowels achieved an accuracy of only 81 %. We conclude that the results indicate a good stress classification accuracy, although further research with more extensive data is required.
机译:在目前的探索性研究中,我们研究了在压力条件下检测女性语音的口语元音的声学测量方法。分析时选择了从八名芬兰成年女性发言人那里收到的八次真实记录的紧急服务电话。根据通话的目的,将录音分为两组:重音组和中立组。我们选择f0,H1-H2和重心作为最终分类模型的语音预测指标;在先前的研究中,高fO与压力的语音有关,但是H1-H2和重心以前与压力下的语音没有关系。另一方面,H1-H2已被用于检测非模态语音质量,例如吱吱作响的或呼吸的语音,并且在重音中也观察到了类似的语音质量。此外,有迹象表明,在压力下的语音中,声能集中在较高的频率上,因此增加了重心。我们使用三个统计分类器测试了压力检测的准确性:LDA,逻辑回归和决策树。我们的结果表明,仅使用I'll将使用元音训练而不是使用所有芬兰元音训练它们时,所有模型的性能都更好。使用性能最佳的模型(基于/ i /的逻辑回归模型)可以得出88%正确的分类,而使用所有元音训练的逻辑回归模型只能达到81%的准确度。我们得出的结论是,尽管需要对更广泛的数据进行进一步的研究,但结果表明应力分类的准确性较高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号