首页> 外国专利> COMPOSITE MODEL GENERATING DEVICE FOR VOICE AND IMAGE, ENVIRONMENT ADAPTING DEVICE FOR COMPOSITE MODEL OF VOICE AND IMAGE, AND VOICE RECOGNIZING DEVICE

COMPOSITE MODEL GENERATING DEVICE FOR VOICE AND IMAGE, ENVIRONMENT ADAPTING DEVICE FOR COMPOSITE MODEL OF VOICE AND IMAGE, AND VOICE RECOGNIZING DEVICE

机译:语音和图像的复合模型生成装置,语音和图像的复合模型的环境自适应装置以及语音识别装置

摘要

PROBLEM TO BE SOLVED: To provide a composite model generating device for voice and image for voice recognizing device which can performs voice recognition at a high voice recognition rate and the voice recognizing device. SOLUTION: In the composite model generating device 100 for voice and image, an HMM composition part 16 computes the products of the output probabilities of the voice and image in all combinations of states of a voice HMM and an image HMM and generates and composites a composite HMM having a composited Gaussian mixture distribution including the products of the output probabilities in the respective states. Then, an HMM learning part 17 performs connected learning maximizing the output likelihood by using a labeled AV signal in a learning AV data memory 31 according to the generated and composite HMM to generate a composite HMM of the learnt voice and image. A voice recognition part 200 of the voice recognizing device 200 performs voice recognition by using the composite HMM of the learnt voice and image according to the feature quantity of a feature-extracted spoken voice signal and the feature quantity of an image signal.
机译:解决的问题:提供一种能够以高语音识别率执行语音识别的用于语音和图像的复合模型生成装置和语音识别装置。解决方案:在用于语音和图像的合成模型生成设备100中,HMM合成部分16计算语音HMM和图像HMM的状态的所有组合中语音和图像的输出概率的乘积,并生成并合成复合图像。具有复合高斯混合分布的HMM,其中包括各个状态下输出概率的乘积。然后,HMM学习部17根据所生成的合成HMM,通过在学习AV数据存储器31中使用标记的AV信号来执行最大化输出似然性的连接学习,以生成所学习的语音和图像的合成HMM。语音识别装置200的语音识别部分200根据特征提取的语音信号的特征量和图像信号的特征量,使用学习的语音和图像的复合HMM来执行语音识别。

著录项

  • 公开/公告号JP2002169586A

    专利类型

  • 公开/公告日2002-06-14

    原文格式PDF

  • 申请/专利权人 ATR ONSEI GENGO TSUSHIN KENKYUSHO:KK;

    申请/专利号JP20000385184

  • 发明设计人 KUMAGAI KENICHI;NAKAMURA SATORU;

    申请日2000-12-19

  • 分类号G10L15/14;G06N3/00;G06T7/00;G10L15/06;G10L15/24;

  • 国家 JP

  • 入库时间 2022-08-22 00:59:09

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号