An Online Incremental Speaker Adaptation Method Using Speaker-Clustered Initial Models

机译：使用扬声器聚类初始模型的在线增量扬声器适配方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We previously proposed an incremental speaker adaptation method combined with automatic speaker-change detection for broadcast news transcription where speakers change frequently and each of them utters a series of several sentences. In this method, the speaker change is detected using speaker-independent and speaker-adaptive Gaussian mixture models (GMMs). Both phone HMMs and GMMs are incrementally adapted to each speaker by the combination of MLLR, MAP and VFS methods using speaker by the combination of MLLR, MAP and VFS methods using speaker-independent (SI) models as initial models. This paper proposes its improvement in which an initial model for speaker adaptation is selected from a set of models made by speaker clustering. Either cluster-dependent phone HMMs or GMMs are used to calculate the likelihood for selecting the best initial model. In a broadcast news transcription task, the proposed method significantly reduces word error rate compared with the method using SI-HMM as an initial model. Online incremental speaker adaptation results show that word errr rate is reduced by 11.6

机译：我们之前提出了一个增量扬声器适配方法，结合自动扬声器更改检测，用于广播新闻转录，扬声器经常改变，并且它们中的每一个都展开了一系列句子。在该方法中，使用扬声器独立的和扬声器 - 自适应高斯混合模型（GMMS）来检测扬声器变化。通过使用MLLR，MAP和VFS方法的组合使用MLLR，MAP和VFS方法的组合，使用MLLR，MAP和VFS方法的组合，使用扬声器独立（SI）模型作为初始模型，通过MLLR，MAP和VFS方法的组合逐步适应每个扬声器。本文提出了改进，其中扬声器适应初始模型选自扬声器聚类制作的一组模型。依赖于群集的电话HMMS或GMMS用于计算选择最佳初始模型的可能性。在广播新闻转录任务中，与使用SI-HMM的方法作为初始模型相比，该方法显着降低了字错误率。在线增量扬声器适应结果表明，IRRR速率率降低11.6

著录项

来源
《International conference on spoken language processing》|2000年||共4页
会议地点
作者
Zhipeng Zhang; Sadaoki Furui;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G18;
关键词

相似文献

外文文献
中文文献
专利

1. Bayesian Unsupervised Batch and Online Speaker Adaptation of Activation Function Parameters in Deep Models for Automatic Speech Recognition [J] . Zhen Huang, Sabato Marco Siniscalchi, Chin-Hui Lee Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2017,第1期

机译：用于语音识别的深度模型中激活函数参数的贝叶斯无监督批处理和在线说话者自适应
2. Online Speaker Clustering Using Incremental Learning of an Ergodic Hidden Markov Model [J] . Takafumi KOSHINAKA, Kentaro NAGATOMO, Koichi SHINODA IEICE transactions on information and systems . 2012,第10期

机译：使用增量学习的遍历隐马尔可夫模型进行在线说话人聚类
3. Online Speaker Clustering Using Incremental Learning of an Ergodic Hidden Markov Model [J] . Takafumi KOSHINAKA, Kentaro NAGATOMO, Koichi SHINODA IEICE Transactions on Information and Systems . 2012,第10期

机译：使用遍历隐马尔可夫模型的增量学习进行在线说话者聚类
4. An Online Incremental Speaker Adaptation Method Using Speaker-Clustered Initial Models [C] . Zhipeng Zhang, Sadaoki Furui 6th International conference on Spoken Language Processing ICSLP 2000 Oct. 16-Oct.20 2000 Beijing International Convention Center, Beijing, China . 2000

机译：基于说话人聚类初始模型的在线增量说话人适应方法
5. Speaker Characteristic-based Acoustic Model Adaptation Method for Speaker Recognition Systems [D] . Millington, Daniel S. 2011

机译：基于说话者特征的说话人识别系统声学模型自适应方法
6. Incremental Change or Initial Differences? Testing Two Models of Marital Deterioration [O] . Justin A. Lavner, Thomas N. Bradbury, Benjamin R. Karney -1

机译：增量变更或初始差异？测试两种婚姻恶化模型
7. Online Adaptation of Word-initial Ukrainian CC Consonant Clusters by Native Speakers of English [O] . Kateryna Laidler 2017

机译：英语母语讲话者在线适应词初初始乌克兰CC辅音群

An Online Incremental Speaker Adaptation Method Using Speaker-Clustered Initial Models

摘要

著录项

相似文献

相关主题

期刊订阅