首页> 外国专利> Fast speaker recognition scoring using I-vector posteriors and probabilistic linear discriminant analysis

Fast speaker recognition scoring using I-vector posteriors and probabilistic linear discriminant analysis

机译:使用I向量后验和概率线性判别分析的快速说话人识别评分

摘要

A method for performing speaker recognition comprises: estimating respective uncertainties of acoustic coverage of respective speech utterance(s) by first and second speakers, the acoustic coverage representing respective sounds used by the speakers when speaking; representing the respective uncertainties of acoustic coverage in a manner that allows for efficient memory usage by discarding dependencies between uncertainties of different sounds for the speakers; representing the respective uncertainties of acoustic coverage in a manner that allows for efficient computation by representing an inverse of the respective uncertainties of acoustic coverage and then discarding the dependencies between the uncertainties of different sounds for the speakers; and computing a score between the speech utterance(s) by the speakers in a manner that leverages the respective uncertainties of the acoustic coverage during the comparison, the score being indicative of a likelihood that the speakers are the same speaker.
机译:一种执行说话者识别的方法,包括:估计第一和第二说话者各自语音发声的声音覆盖范围的各自不确定性,该声音覆盖范围代表说话者在讲话时使用的各自声音;以及通过丢弃扬声器不同声音的不确定性之间的依赖性,以允许有效利用存储器的方式表示声学覆盖的各个不确定性;以如下方式表示声学覆盖范围的各个不确定性:通过表示声学覆盖范围的各个不确定性的倒数,然后为扬声器丢弃不同声音的不确定性之间的相关性,从而进行有效的计算;并以在比较期间利用声学覆盖范围的各个不确定性的方式来计算说话者在语音发声之间的得分,该得分指示说话者是同一说话者的可能性。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号