首页> 外文学位 >Rover based constrained iterative speech enhancement.
【24h】

Rover based constrained iterative speech enhancement.

机译:基于漫游者的约束迭代语音增强。

获取原文
获取原文并翻译 | 示例

摘要

The degree of impact of environmental noise over phonemes is not uniform since it is dependent on their distinct acoustical properties. The criterion to decrease the impact of noise on speech varies across phonemes. The objective of this study is to suppress noise selectively from speech based on the knowledge of the phonemes, thereby producing an overall speech signal with improved speech quality.; A ROVER based enhancement framework is proposed in this study which employs Auto-LSP, a constrained iterative speech enhancement algorithm, as its baseline system. There are three significant contributions made from this study. First, the new algorithm addresses the issue of dependency of terminating iteration in Auto-LSP by generating a small set of enhanced utterances for every noisy utterance and selecting the best segments from this set using a hard decision scheme based on phoneme class specific constraints. Second, a soft decision method is also formulated to alleviate the effect of errors made in hard decisions. Finally, auditory masking threshold based enhancement technique is integrated into the hard and soft decision methods. The proposed enhancement algorithms are evaluated over the TIMIT speech corpus, using objective quality assessment tests based on the Itakura-Saito distortion, segmental SNR, and PESQ metrics. Results demonstrate that the proposed algorithms are more effective in achieving improved levels of speech quality for almost all phoneme classes and noise types considered in this study.
机译:环境噪声对音素的影响程度并不统一,因为它取决于音素的独特声学特性。降低噪声对语音的影响的标准因音素而异。该研究的目的是基于音素的知识选择性地抑制语音中的噪声,从而产生具有改善语音质量的整体语音信号。在这项研究中提出了一个基于ROVER的增强框架,该框架采用约束迭代语音增强算法Auto-LSP作为其基线系统。这项研究做出了三项重大贡献。首先,新算法通过为每个嘈杂的话语生成一小组增强的话语并使用基于音素类特定约束的硬决策方案从该集中选择最佳段,从而解决了Auto-LSP中终止迭代的依赖性问题。其次,还制定了软决策方法来减轻硬决策中错误的影响。最后,基于听觉掩蔽阈值的增强技术被集成到硬决策和软决策方法中。使用基于Itakura-Saito失真,分段SNR和PESQ指标的客观质量评估测试,对TIMIT语音语料库评估提出的增强算法。结果表明,对于本研究中考虑的几乎所有音素类别和噪声类型,所提出的算法在提高语音质量方面均更为有效。

著录项

  • 作者

    Das, Amit.;

  • 作者单位

    University of Colorado at Boulder.;

  • 授予单位 University of Colorado at Boulder.;
  • 学科 Engineering Electronics and Electrical.
  • 学位 M.S.
  • 年度 2007
  • 页码 66 p.
  • 总页数 66
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 无线电电子学、电信技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号