首页> 外国专利> Self-learning speaker adaptation based on spectral bias source decomposition, using very short calibration speech

Self-learning speaker adaptation based on spectral bias source decomposition, using very short calibration speech

机译:基于频谱偏置源分解的自学扬声器自适应,使用非常短的校准语音

摘要

A speaker adaptation technique based on the separation of speech spectra variation sources is developed for improving speaker- independent continuous speech recognition. The variation sources include speaker acoustic characteristics, and contextual dependency of allophones. Statistical methods are formulated to normalize speech spectra based on speaker acoustic characteristics and then adapt mixture Gaussian density phone models based on speaker phonologic characteristics. Adaptation experiments using short calibration speech (5 sec./speaker) have shown substantial performance improvement over the baseline recognition system.
机译:开发了一种基于语音频谱变化源分离的说话人自适应技术,以改善独立于说话人的连续语音识别。变化源包括扬声器的声学特性和同音素的上下文相关性。制定了统计方法,以根据说话者的声学特性对语音频谱进行归一化,然后根据说话者的语音特性调整混合高斯密度电话模型。使用简短的校准语音(每个扬声器5秒)的适应性实验显示,与基准识别系统相比,其性能有了显着提高。

著录项

  • 公开/公告号US5794192A

    专利类型

  • 公开/公告日1998-08-11

    原文格式PDF

  • 申请/专利权人 PANASONIC TECHNOLOGIES INC.;

    申请/专利号US19960712802

  • 发明设计人 YUNXIN ZHAO;

    申请日1996-09-12

  • 分类号G10L5/06;G10L7/08;

  • 国家 US

  • 入库时间 2022-08-22 02:38:54

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号