首页> 外文期刊>IEICE Transactions on Information and Systems >Multiple Regression of Log Spectra for In-Car Speech Recognition Using Multiple Distributed Microphones
【24h】

Multiple Regression of Log Spectra for In-Car Speech Recognition Using Multiple Distributed Microphones

机译:对数谱的多元回归,用于使用多个分布式麦克风进行车内语音识别

获取原文
获取原文并翻译 | 示例
           

摘要

This paper describes a new multi-channel method of noisy speech recognition, which estimates the log spectrum of speech at a close-talking microphone based on the multiple regression of the log spectra (MRLS) of noisy signals captured by distributed microphones. The advantages of the proposed method are as follows: 1) The method does not require a sensitive geometric layout, calibration of the sensors nor additional pre-processing for tracking the speech source; 21 System works in very small computation amounts; and 3) Regression weights can be statistically optimized over the given training data. Once the optimal regression weights are obtained by regression learning, they can be utilized to generate the estimated log spectrum in the recognition phase, where the speech of close-talking is no longer required. The performance of the proposed method is illustrated by speech recognition of real in-car dialogue data. In comparison to the nearest distant microphone and multi-microphone adaptive beamformer, the proposed approach obtains relative word error rate (WER) reductions of 9.8% and 3.6%, respectively.
机译:本文介绍了一种新的多通道噪声语音识别方法,该方法基于分布式麦克风捕获的噪声信号的对数谱(MRLS)的多元回归来估计近距离麦克风的语音对数谱。所提出的方法的优点如下:1)该方法不需要敏感的几何布局,传感器的校准或用于跟踪语音源的附加预处理。 21系统以很小的计算量工作; 3)可以根据给定的训练数据对回归权重进行统计优化。一旦通过回归学习获得了最佳回归权重,就可以将其用于在识别阶段生成估计的对数谱,此时不再需要近距离交谈的语音。通过对真实车内对话数据的语音识别来说明所提出方法的性能。与最近的远距离麦克风和多麦克风自适应波束形成器相比,该方法获得的相对字误码率(WER)降低分别为9.8%和3.6%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号