首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Speaker verification with the mixture of Gaussian factor analysis based representation
【24h】

Speaker verification with the mixture of Gaussian factor analysis based representation

机译:结合基于高斯因子分析的表示进行说话人验证

获取原文

摘要

This paper presents a generalized i-vector representation framework using the mixture of Gaussian (MoG) factor analysis for speaker verification. Conventionally, a single standard factor analysis is adopted to generate a low rank total variability subspace where the mean supervector is assumed to be Gaussian distributed. The energy that can't be represented by the low rank space is modeled by a single multivariate Gaussian. However, due to the sparsity of the frame level posterior probability and the short duration characteristics, some dimensions of the first-order statistics may not be Gaussian distributed. Therefore, we replace the single Gaussian with a mixture of Gaussians to better represent the residual energy. Experimental results on the NIST SRE 2010 condition 5 female task and the RSR 2015 part 1 female task show that the MoG i-vector outperforms the i-vector baseline by more than 10% relatively for both text independent and text dependent speaker verification tasks, respectively.
机译:本文提出了一种使用混合高斯(MoG)因子分析进行说话人验证的通用i向量表示框架。常规地,采用单个标准因子分析来生成低秩总可变性子空间,其中平均超向量被假定为高斯分布。低阶空间无法表示的能量由单个多元高斯模型建模。但是,由于帧级后验概率的稀疏性和短持续时间特性,一阶统计量的某些维度可能不是高斯分布的。因此,我们用混合高斯代替单个高斯,以更好地表示剩余能量。在NIST SRE 2010条件5女性任务和RSR 2015第1部分女性任务上的实验结果表明,对于独立于文本的说话者和依赖于文本的说话者验证任务,MoG i-vector相对于i-vector基线的性能分别高出10%以上。 。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号