Multiple Regression of Log Spectra for In-Car Speech Recognition Using Multiple Distributed Microphones

Weifeng LI; Tetsuya SHINDE; Hiroshi FUJIMURA; Chiyomi MIYAJIMA; Takanori NISHINO; Katunobu ITOU; Kazuya TAKEDA; Fumitada ITAKURA

首页> 外文期刊>IEICE Transactions on Information and Systems >Multiple Regression of Log Spectra for In-Car Speech Recognition Using Multiple Distributed Microphones

【24h】

Multiple Regression of Log Spectra for In-Car Speech Recognition Using Multiple Distributed Microphones

机译：对数谱的多元回归，用于使用多个分布式麦克风进行车内语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes a new multi-channel method of noisy speech recognition, which estimates the log spectrum of speech at a close-talking microphone based on the multiple regression of the log spectra (MRLS) of noisy signals captured by distributed microphones. The advantages of the proposed method are as follows: 1) The method does not require a sensitive geometric layout, calibration of the sensors nor additional pre-processing for tracking the speech source; 21 System works in very small computation amounts; and 3) Regression weights can be statistically optimized over the given training data. Once the optimal regression weights are obtained by regression learning, they can be utilized to generate the estimated log spectrum in the recognition phase, where the speech of close-talking is no longer required. The performance of the proposed method is illustrated by speech recognition of real in-car dialogue data. In comparison to the nearest distant microphone and multi-microphone adaptive beamformer, the proposed approach obtains relative word error rate (WER) reductions of 9.8% and 3.6%, respectively.

机译：本文介绍了一种新的多通道噪声语音识别方法，该方法基于分布式麦克风捕获的噪声信号的对数谱（MRLS）的多元回归来估计近距离麦克风的语音对数谱。所提出的方法的优点如下：1）该方法不需要敏感的几何布局，传感器的校准或用于跟踪语音源的附加预处理。 21系统以很小的计算量工作； 3）可以根据给定的训练数据对回归权重进行统计优化。一旦通过回归学习获得了最佳回归权重，就可以将其用于在识别阶段生成估计的对数谱，此时不再需要近距离交谈的语音。通过对真实车内对话数据的语音识别来说明所提出方法的性能。与最近的远距离麦克风和多麦克风自适应波束形成器相比，该方法获得的相对字误码率（WER）降低分别为9.8％和3.6％。

著录项

来源
《IEICE Transactions on Information and Systems》 |2005年第3期|p.384-390|共7页
作者
Weifeng LI; Tetsuya SHINDE; Hiroshi FUJIMURA; Chiyomi MIYAJIMA; Takanori NISHINO; Katunobu ITOU; Kazuya TAKEDA; Fumitada ITAKURA;
展开▼
作者单位

Department of Information Electronics, Graduate School of Engineering, Nagoya University, Nagoya-shi, 464—8603 Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
speech recognition; microphone arrays; adaptive beamforming; signal-to-deviation ratio; multiple regression;

机译：语音识别麦克风阵列自适应波束形成信噪比多元回归;

相似文献

外文文献
中文文献
专利

1. Adaptive log-spectral regression for in-car speech recognition using multiple distributed microphones [J] . Weifeng Li, Takeda K., Itakura F. IEEE signal processing letters . 2005,第4期

机译：使用多个分布式麦克风的自适应对数谱回归用于车内语音识别
2. Adaptive Nonlinear Regression Using Multiple Distributed Microphones for In-Car Speech Recognition [J] . Weifeng LI, Chiyomi MIYAJIMA, Takanori NISHINO, IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences . 2005,第7期

机译：使用多个分布式麦克风的自适应非线性回归用于车内语音识别
3. In-car speech recognition based on multiple regression of spectra [J] . Tetsuya Shinde, Kazuya Takeda, Fumitada Itakura 電子情報通信学会技術研究報告. 音声. Speech . 2002,第33期

机译：基于频谱多元回归的车内语音识别
4. Optimizing Regression for in-car Speech Recognition using Multiple Distributed Microphones [C] . Weifeng Li, Kazuya Takeda, Fumitada Itakura International Conference on Spoken Language Processing; 20041004-08; Jeju(KR) . 2004

机译：使用多个分布式麦克风为车内语音识别优化回归
5. A COMPARISON OF SIX MODELS FOR PREDICTING CORPORATE BANKRUPTCY: MULTIPLE LINEAR REGRESSION ANALYSIS, MULTIPLE LINEAR DISCRIMINANT ANALYSIS, STEPWISE REGRESSION ANALYSIS, STEPWISE DISCRIMINANT ANALYSIS, MULTIPLE LINEAR REGRESSION ANALYSIS WITH RIDGE REGRESSION, AND MULTIPLE LINEAR DISCRIMINANT ANALYSIS WITH BIASED MINIMUM CHI-SQUARE RULE [D] . MAPP, JOHNNIE ALBERT. 1981

机译：六种预测公司破产的模型的比较：多个线性回归分析，多个线性判别分析，逐步回归分析，逐步判别分析，多个带岭点回归的线性回归分析，以及多个线性离散
6. An improved peptide-spectral matching algorithm through distributed search over multiple cores and multiple CPUs [O] . Jian Sun, Bolin Chen, Fang-Xiang Wu 2014

机译：通过在多个核和多个CPU上进行分布式搜索改进了肽谱匹配算法
7. Robust In-Car Speech Recognition Based on Nonlinear Multiple Regressions [O] . Weifeng Li, Kazuya Takeda, Fumitada Itakura 2007

机译：基于非线性多元回归的鲁棒汽车语音识别
8. Speech Recognition Using Multiple Features and Multiple Recognizers [R] . Rathbun, T. F. 1991

机译：使用多个特征和多个识别器的语音识别

Multiple Regression of Log Spectra for In-Car Speech Recognition Using Multiple Distributed Microphones

摘要

著录项

相似文献

相关主题

期刊订阅