...
首页> 外文期刊>PLoS One >Short-time speaker verification with different speaking style utterances
【24h】

Short-time speaker verification with different speaking style utterances

机译:用不同的口语风格话语验证短时间发言者验证

获取原文
           

摘要

In recent years, great progress has been made in the technical aspects of automatic speaker verification (ASV). However, the promotion of ASV technology is still a very challenging issue, because most technologies are still very sensitive to new, unknown and spoofing conditions. Most previous studies focused on extracting target speaker information from natural speech. This paper aims to design a new ASV corpus with multi-speaking styles and investigate the ASV robustness to these different speaking styles. We first release this corpus in the Zenodo website for public research, in which each speaker has several text-dependent and text-independent singing, humming and normal reading speech utterances. Then, we investigate the speaker discrimination of each speaking style in the feature space. Furthermore, the intra and inter-speaker variabilities in each different speaking style and cross-speaking styles are investigated in both text-dependent and text-independent ASV tasks. Conventional Gaussian Mixture Model (GMM), and the state-of-the-art x-vector are used to build ASV systems. Experimental results show that the voiceprint information in humming and singing speech are more distinguishable than that in normal reading speech for conventional ASV systems. Furthermore, we find that combing the three speaking styles can significantly improve the x-vector based ASV system, even when only limited gains are obtained by conventional GMM-based systems.
机译:近年来,在自动演讲者验证(ASV)的技术方面取得了很大进展。然而,促进ASV技术仍然是一个非常具有挑战性的问题,因为大多数技术对新的,未知和欺骗性的条件仍然非常敏感。最先前的研究专注于从自然语音中提取目标扬声器信息。本文旨在设计具有多口语风格的新型ASV语料库,并调查对这些不同讲话方式的ASV鲁棒性。我们首先在Zenodo网站上发布了这个语料库,用于公共研究,其中每个发言者都有几种依赖和独立于文本的歌唱,嗡嗡声和正常阅读语言的话语。然后,我们调查在特征空间中的每个说话方式的发言者歧视。此外,在文本相关和文本独立于文本的ASV任务中调查了每个不同的说话方式和跨讲样式的讲话器和讲话者的变量。传统的高斯混合模型(GMM)和最先进的X-向量用于构建ASV系统。实验结果表明,嗡嗡声和唱歌语音中的声音信息比传统ASV系统正常阅读语言更区别。此外,我们发现,即使只有在基于GMM的系统获得的只有有限的增益,也可以显着改善基于X-向量的ASV系统的梳理三种样式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号