首页> 外文期刊>IEICE Transactions on Information and Systems >Fundamental Frequency Estimation for Noisy Speech Using Entropy-Weighted Periodic and Harmonic Features
【24h】

Fundamental Frequency Estimation for Noisy Speech Using Entropy-Weighted Periodic and Harmonic Features

机译:基于熵加权的周期和谐波特征的嘈杂语音基本频率估计

获取原文
获取原文并翻译 | 示例
           

摘要

This paper proposes a robust method for estimating the fundamental frequency (F0) in real environments. It is assumed that the spectral structure of real environmental noise varies momentarily and its energy does not distribute evenly in the time-frequency domain. Therefore, segmenting a spectrogram of speech mixed with environmental noise into narrow time-frequency regions will produce low-noise regions in which the signal-to-noise ratio is high. The proposed method estimates F0 from the periodic and harmonic features that are clearly observed in the low-noise regions. It first uses two kinds of spectrogram, one with high frequency resolution and another with high temporal resolution, to represent the periodic and harmonic features corresponding to F0. Next, the method segments these two kinds of feature plane into narrow time-frequency regions, and calculates the probability function of F0 for each region. It then utilizes the entropy of the probability function as weight to emphasize the probability function in the low-noise region and to enhance noise robustness. Finally, the probability functions are grouped in each time, and F0 is obtained as the frequency with the highest probability of the function. The experimental results showed that, in comparison with other approaches such as the cepstrum method and the autocorrelation method, the developed method can more robustly estimate F0s from speech in the presence of band-limited noise and car noise.
机译:本文提出了一种在实际环境中估算基频(F0)的鲁棒方法。假定实际环境噪声的频谱结构会瞬时变化,并且其能量在时频域中分布不均匀。因此,将与环境噪声混合的语音频谱图分割成狭窄的时频区域将产生低信噪比的低噪声区域。所提出的方法根据在低噪声区域中清晰观察到的周期性和谐波特征来估计F0。它首先使用两种频谱图,一种具有高频率分辨率,另一种具有高时间分辨率,来表示与F0对应的周期和谐波特征。接下来,该方法将这两种特征平面划分为狭窄的时频区域,并为每个区域计算F0的概率函数。然后,它利用概率函数的熵作为权重,以强调低噪声区域中的概率函数并增强噪声的鲁棒性。最后,每次对概率函数进行分组,并获得F0作为函数概率最高的频率。实验结果表明,与倒谱法和自相关法等其他方法相比,该方法在存在带限噪声和汽车噪声的情况下,可以更加可靠地从语音中估计F0。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号