首页> 外国专利> Method and system for microphone array input type speech recognition using band-pass power distribution for sound source position/direction estimation

Method and system for microphone array input type speech recognition using band-pass power distribution for sound source position/direction estimation

机译:使用带通功率分布进行声源位置/方向估计的麦克风阵列输入类型语音识别的方法和系统

摘要

A microphone array input type speech recognition scheme capable of realizing a high precision sound source position or direction estimation by a small amount of calculations, and thereby realizing a high precision speech recognition. A band-pass waveform, which is a waveform for each frequency bandwidth, is obtained from input signals of the microphone array, and a band-pass power of the sound source is directly obtained from the band-pass waveform. Then, the obtained band- pass power is used as the speech parameter. It is also possible to realize the sound source estimation and the band-pass power estimation at high precision while further reducing an amount of calculations, by utilizing a sound source position search processing in which a low resolution position estimation and a high resolution position estimation are combined.
机译:麦克风阵列输入型语音识别方案,能够通过少量的计算来实现高精度的声源位置或方向估计,从而实现高精度的语音识别。从麦克风阵列的输入信号获得作为每个频率带宽的波形的带通波形,并且直接从该带通波形获得声源的带通功率。然后,将获得的带通功率用作语音参数。通过利用其中低分辨率位置估计和高分辨率位置估计为零的声源位置搜索处理,还可以在进一步减少计算量的同时高精度地实现声源估计和带通功率估计。结合。

著录项

  • 公开/公告号US6009396A

    专利类型

  • 公开/公告日1999-12-28

    原文格式PDF

  • 申请/专利权人 KABUSHIKI KAISHA TOSHIBA;

    申请/专利号US19970818672

  • 发明设计人 YOSHIFUMI NAGATA;

    申请日1997-03-14

  • 分类号G10L3/00;

  • 国家 US

  • 入库时间 2022-08-22 01:38:25

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号