首页> 外文OA文献 >Improving short utterance based I-vector speaker recognition using source and utterance-duration normalization techniques
【2h】

Improving short utterance based I-vector speaker recognition using source and utterance-duration normalization techniques

机译:使用源和话语持续时间归一化技术改进基于短话语的I矢量说话人识别

摘要

A significant amount of speech is typically required for speaker verification system development and evaluation, especially in the presence of large intersession variability. This paper introduces a source and utterance duration normalized linear discriminant analysis (SUN-LDA) approaches to compensate session variability in short-utterance i-vector speaker verification systems. Two variations of SUN-LDA are proposed where normalization techniques are used to capture source variation from both short and full-length development i-vectors, one based upon pooling (SUN-LDA-pooled) and the other on concatenation (SUN-LDA-concat) across the duration and source-dependent session variation. Both the SUN-LDA-pooled and SUN-LDA-concat techniques are shown to provide improvement over traditional LDA on NIST 08 truncated 10sec-10sec evaluation conditions, with the highest improvement obtained with the SUN-LDA-concat technique achieving a relative improvement of 8% in EER for mis-matched conditions and over 3% for matched conditions over traditional LDA approaches.
机译:说话者验证系统的开发和评估通常需要大量语音,尤其是在会话间存在较大差异的情况下。本文介绍了一种源和话音持续时间归一化线性判别分析(SUN-LDA)方法,以补偿短话i向量说话者验证系统中的会话可变性。提出了SUN-LDA的两种变体,其中使用规范化技术从短和全长开发i向量中捕获源变异,一种基于合并(SUN-LDA池化),另一种基于串联(SUN-LDA-持续时间和与来源相关的会话变化)。已显示SUN-LDA合并技术和SUN-LDA concat技术在NIST 08截断10sec-10sec评估条件下提供了优于传统LDA的改进,SUN-LDA concat技术获得的最高改进实现了相对改进。与传统的LDA方法相比,不匹配条件的EER为8%,匹配条件的为3%以上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号