首页> 外文会议>INTERSPEECH 2012 >A Non-Uniform Filterbank for Speaker Recognition
【24h】

A Non-Uniform Filterbank for Speaker Recognition

机译:用于扬声器识别的非均匀滤波器

获取原文

摘要

It is known that speaker-specific information is distributed nonuniformly in the frequency domain. Current speaker recognition systems utilize auditory-motivated scales for extracting acoustic features. These scales, however, are not optimised to exploit the spectral distribution of speaker-specific information and hence may not be the optimal choice for speaker recognition. In this paper, the authors studied the distribution of speaker-specific information for Spectral Centroid Frequency feature, and a nonuniform filter bank is proposed to capture the information effectively for spectral centroid feature. The F-ratio and Kullback-Leibler (KL) distance were used to measure distribution of speaker-specific information and it was empirically shown that the KL distance is better than F-ratio in measuring discriminative ability. The proposed filterbank emphasises the high KL distance regions by allocating more filters in those regions. Experimental results showed a relative EER reduction of 8.8% over the Mel-scale filterbank on NIST2006 SRE database.
机译:已知扬声器特定信息在频域中分布不均匀。当前扬声器识别系统利用听觉激励尺度来提取声学特征。然而,这些规模未得到优化,以利用扬声器特定信息的光谱分布,因此可能不是扬声器识别的最佳选择。在本文中,作者研究了频谱质心频率特征的扬声器特定信息的分布,并且提出了一个非均匀的滤波器库来捕获信息,以有效地捕获频谱质心特征。 F比和Kullback-Leibler(KL)距离用于测量扬声器特定信息的分布,并且经验证明KL距离优于测量辨别能力的F比率。所提出的滤波器通过在这些区域中分配更多滤波器来强调高KL距离区域。实验结果表明,在NIST2006 SRE数据库上的Mel-Scale FilterBank上的相对亮度降低了8.8%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号