【24h】

STATISTICAL MODEL MIGRATION IN SPEAKER RECOGNITION

机译:扬声器识别中的统计模型迁移

获取原文
获取原文并翻译 | 示例

摘要

In large-scale deployments of speaker recognition systems the potential for legacy problems increases as the evolving technology may require configuration changes in the system thus invalidating already existing user voice accounts. Unless the entire database of original speech waveform were stored, users need to reenroll to keep their accounts functional, which, however, may be expensive and commercially not acceptable. We define model migration as a conversion of obsolete models to new-configuration models without additional data and waveform requirements and investigate ways to achieve such a migration with minimum loss of system accuracy. As a proof-of-concept, an algorithm for statistical migration in the Maximum A-Posteriori framework is studied and evaluated experimentally using the NIST SRE-03 dataset. The migration step is discussed in a wider conceptual framework of Conversational Biometrics.
机译:在说话者识别系统的大规模部署中,随着不断发展的技术可能要求更改系统中的配置,遗留问题的可能性增加,从而使已经存在的用户语音帐户无效。除非存储了原始语音波形的整个数据库,否则用户需要重新注册才能保持其帐户功能正常,但是这可能很昂贵,并且在商业上不可接受。我们将模型迁移定义为在没有其他数据和波形要求的情况下将过时的模型转换为新配置的模型,并研究以最小的系统精度损失实现这种迁移的方法。作为概念验证,使用NIST SRE-03数据集研究和评估了在Maximum A-Posteriori框架中进行统计迁移的算法。在会话生物识别的更广泛概念框架中讨论了迁移步骤。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号