首页> 外文OA文献 >Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition
【2h】

Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition

机译:使用由MLLR转换生成的伪扬声器特征进行声学模型训练,以实现与扬声器无关的可靠语音识别

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

A novel speech feature generation-based acoustic model training method for robust speaker-independent speech recognition is proposed. For decades, speaker adaptation methods have been widely used. All of these adaptation methods need adaptation data. However, our proposed method aims to create speaker-independent acoustic models that cover not only known but also unknown speakers. We achieve this by adopting inverse maximum likelihood linear regression (MLLR) transformation-based feature generation, and then we train our models using these features. First we obtain MLLR transformation matrices from a limited number of existing speakers. Then we extract the bases of the MLLR transformation matrices using PCA. The distribution of the weight parameters to express the transformation matrices for the existing speakers are estimated. Next, we construct pseudo-speaker transformations by sampling the weight parameters from the distribution, and apply the transformation to the normalized features of the existing speaker to generate the features of the pseudo-speakers. Finally, using these features, we train the acoustic models. Evaluation results show that the acoustic models trained using our proposed method are robust for unknown speakers.
机译:提出了一种新颖的基于语音特征生成的声学模型训练方法,用于鲁棒的独立于说话人的语音识别。几十年来,说话人适应方法已被广泛使用。所有这些适配方法都需要适配数据。但是,我们提出的方法旨在创建独立于扬声器的声学模型,该模型不仅覆盖已知扬声器,而且还覆盖未知扬声器。我们通过采用基于逆最大似然线性回归(MLLR)变换的特征生成来实现此目的,然后使用这些特征训练模型。首先,我们从数量有限的现有说话人那里获得MLLR变换矩阵。然后,我们使用PCA提取MLLR变换矩阵的基数。估计表示现有说话者的变换矩阵的权重参数的分布。接下来,我们通过从分布中采样权重参数来构造伪扬声器变换,并将该变换应用于现有扬声器的归一化特征以生成伪扬声器的特征。最后,使用这些功能,我们可以训练声学模型。评估结果表明,使用我们提出的方法训练的声学模型对于未知的说话人具有鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号