首页> 外文会议> >A study on speaker normalization using vocal tract normalization and speaker adaptive training

【24h】

A study on speaker normalization using vocal tract normalization and speaker adaptive training

机译：利用声道归一化和说话人自适应训练对说话人归一化的研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Although speaker normalization is attempted in very different manners, vocal tract normalization (VTN) and speaker adaptive training (SAT) share many common properties. We show that both lead to more compact representations of the phonetically relevant variations of the training data and that both achieve improved error rate performance only if a complementary normalization or adaptation operation is conducted on the test data. Algorithms for fast test speaker enrolment are presented for both normalization methods: in the framework of SAT, a pre-transformation step is proposed, which alone, i.e. without subsequent unsupervised MLLR adaptation, reduces the error rate by almost 10% on the WSJ 5k test sets. For VTN, the use of a Gaussian mixture model makes obsolete a first recognition pass to obtain a preliminary transcription of the test utterance at hardly any loss in performance.

机译：尽管以非常不同的方式尝试了说话人归一化，但是声道归一化（VTN）和说话者自适应训练（SAT）具有许多共同的特性。我们表明，这两种方法都可以更紧凑地表示训练数据的语音相关变化，并且只有在对测试数据进行互补的归一化或自适应操作时，两者才能实现更高的误码率性能。针对这两种归一化方法，提出了用于快速测试演讲者注册的算法：在SAT的框架中，提出了一个预转换步骤，仅此一步，即无需后续的无监督MLLR自适应，就可以将WSJ 5k测试的错误率降低近10％套。对于VTN，使用高斯混合模型会使首次识别过时而在性能几乎不降低的情况下过早地获得测试话语的初步转录。

著录项

来源
《》|1998年|P.797-800|共4页
会议地点
作者
Welling; L.; Haeb-Umbach; R.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Speaker adaptive modeling by vocal tract normalization [J] . Welling L., Ney H., Kanthak S. IEEE Transactions on Speech and Audio Proceessing . 2002,第6期

机译：通过声道归一化的说话人自适应建模
2. Speaker adaptive modeling by vocal tract normalization [J] . Welling L., Ney H., Kanthak S. IEEE Transactions on Speech and Audio Proceeding . 2002,第6期

机译：通过声道归一化的说话人自适应建模
3. Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker Verification [J] . Walid Hussein, Sarah Akram Essmat, Nestor Yoma, International Journal of Recent Contributions from Engineering, Science & IT . 2016,第4期

机译：用于说话人验证的基于人行道长度归一化分类器的评估
4. A study on speaker normalization using vocal tract normalization and speaker adaptive training [C] . Welling L., Haeb-Umbach R., Institute of Electric and Electronic Engineer IEEE International Conference on Acoustics, Speech and Signal Processing . 1998

机译：用声道归一化与扬声器自适应培训的扬声器标准化研究
5. Frequency warping by linear transformation, and vocal tract inversion for speaker normalization in automatic speech recognition. [D] . Panchapagesan, Sankaran. 2008

机译：通过线性变换实现的频率扭曲和声道反转，可在自动语音识别中实现说话人归一化。
6. Revisiting vocal perception in non-human animals: a review of vowel discrimination speaker voice recognition and speaker normalization [O] . Buddhamas Kriengwatana, Paola Escudero, Carel ten Cate 2014

机译：重温非人类动物的声音感知：元音辨别说话人语音识别和说话人正常化的综述
7. A study on speaker normalization using vocal tract normalization and speaker adaptive training [O] . Welling Lutz, Haeb-Umbach R., Aubert Xavier L., 1998

机译：利用声道归一化和说话人自适应训练对说话人归一化的研究

A study on speaker normalization using vocal tract normalization and speaker adaptive training

摘要

著录项

相似文献

相关主题

期刊订阅