首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >Transient-based speech transmission index for predicting intelligibility in nonlinear speech enhancement processors
【24h】

Transient-based speech transmission index for predicting intelligibility in nonlinear speech enhancement processors

机译:基于瞬态的语音传输指数,用于预测非线性语音增强处理器中的清晰度

获取原文

摘要

A new speech intelligibility metric is proposed for the assessment of speech enhancement processors. These processors usually affect the fine structure in speech that is of fundamental importance to speech intelligibility. Classical metrics analyze the entire signal and thereby generally overestimate intelligibility. The measure presented here, therefore, isolates speech-transients by a cepstral smoothing technique and subsequently calculates speech intelligibility using an efficient version of the speech transmission index. By means of a genetic optimization of adjustable parameters, the proposed transition-based speech transmission index (TB STI) is adapted to the subjective data of linearly and nonlinearly processed speech. The method was assessed on untrained subjective data and showed a considerable improvement over other well-established measures.
机译:提出了一种新的语音清晰度指标,用于评估语音增强处理器。这些处理器通常会影响语音的精细结构,这对于语音可懂度至关重要。经典指标会分析整个信号,因此通常会高估清晰度。因此,此处介绍的措施通过倒谱平滑技术隔离了语音瞬态,然后使用语音传输索引的有效版本来计算语音清晰度。通过可调参数的遗传优化,所提出的基于过渡的语音传输指数(TB STI)适用于线性和非线性处理的语音的主观数据。该方法是在未经训练的主观数据上进行评估的,与其他公认的方法相比,显示出了很大的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号