首页> 外文学位 >Spectral analysis methods for automatic speech recognition applications.
【24h】

Spectral analysis methods for automatic speech recognition applications.

机译:用于自动语音识别应用程序的频谱分析方法。

获取原文
获取原文并翻译 | 示例

摘要

In this thesis, we evaluate the front-end of Automatic Speech Recognition (ASR) systems, with respect to different types of spectral processing methods that are extensively used. A filter bank approach for front end spectral analysis is one of the common methods used for spectral analysis. In this work we describe and evaluate spectral analysis based on Mel and Gammatone filter banks. These filtering methods are derived from auditory models and are thought to have some advantages for automatic speech recognition work. Experimentally, however, we show that direct use of FFT spectral values is just as effective as using either Mel or Gammatone filter banks, provided that the features extracted from the FFT spectral values take into account a Mel or Mel-like frequency scale. It is also shown that trajectory features based on sliding block of spectral features, computed using either FFT or filter bank spectral analysis are considerably more effective, in terms of ASR accuracy, than are delta and delta-delta terms often used for ASR. Although there is no major performance disadvantage to using a filter bank, simplicity of analysis is a reason to eliminate this step in speech processing. These assertions hold for both clean and noisy speech.
机译:在本文中,我们针对广泛使用的不同类型的频谱处理方法,评估了自动语音识别(ASR)系统的前端。用于前端光谱分析的滤波器组方法是用于光谱分析的常用方法之一。在这项工作中,我们描述和评估基于Mel和Gammatone滤波器组的光谱分析。这些过滤方法是从听觉模型中得出的,并被认为在自动语音识别工作中具有一些优势。然而,通过实验,我们表明,直接使用FFT频谱值与使用Mel或Gammatone滤波器组一样有效,只要从FFT频谱值中提取的特征考虑了Mel或类似Mel的频率标度即可。还显示出,在ASR精度方面,使用FFT或滤波器组频谱分析计算的基于频谱特征滑动块的轨迹特征比ASR经常使用的增量和增量-增量术语有效得多。尽管使用滤波器组没有主要的性能劣势,但是分析的简单性是消除语音处理中此步骤的原因。这些主张对于干净的和嘈杂的讲话都适用。

著录项

  • 作者单位

    State University of New York at Binghamton.;

  • 授予单位 State University of New York at Binghamton.;
  • 学科 Engineering Electronics and Electrical.;Speech Communication.;Physics Acoustics.
  • 学位 M.S.
  • 年度 2013
  • 页码 97 p.
  • 总页数 97
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 水产、渔业;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号