首页> 外文期刊>International journal of simulation: systems, science and technology >A NOVEL SPEECH COMPRESSION TECHNIQUE USING OPTIMIZED WAVELET TRANSFORM TO IMPROVE THE QUALITY OF AUDITORY PERCEPTION UNDER LOW SNR CONDITIONS
【24h】

A NOVEL SPEECH COMPRESSION TECHNIQUE USING OPTIMIZED WAVELET TRANSFORM TO IMPROVE THE QUALITY OF AUDITORY PERCEPTION UNDER LOW SNR CONDITIONS

机译:一种新颖的语音压缩技术,采用优化小波变换,提高低SNR条件下听觉感知的质量

获取原文
           

摘要

Speech compression in poor environment, where the signal energy is weak due to acoustical disturbances, can improvethe efficiency of transmission while reducing the bandwidth if intelligibility and/or quality can be preserved by selectingappropriate energy based wavelets. We propose an Optimized Wavelet Transform (OWT) to improve speech perception byincorporating masking techniques in the algorithm to reduce the noise effect. Adaptive Wavelet Selection followed by optimizedquantization exploit a robust Dynamic Dictionary Scheme (DDS) to perform efficient compression while preserving speechintelligibility and perceptual quality. An additional lossless coding technique inevitably increases the compression ratio whilepreserving the quality of the signal. Finally, decompressing the compressed signal undergoes tonal and noise masking by applying aglobal threshold based on Sub-Band Perceptual Factor (SBPF) and Perceptual Entropy (PE), which improves the quality of thesignal. Performance of the proposed algorithm is obtained in terms of Normalized Root-Mean Square Error (NRMSE),Compression Ratio (CR), Performance Evaluation of Speech Quality (PESQ), Re-construction Distortion Length (RDL), Signal toNoise Ratio (SNR) for various voiced and unvoiced signals recorded in low SNR conditions. All the signals are derived fromNOIZEUS data base and some samples are recorded and normalized to operate at sampling frequency of 8KHz.
机译:在差的环境中的语音压缩,信号能量由于声学干扰而弱,可以提高传输的效率,同时通过选择不适用于基于能量的小波来保存可清晰度和/或质量的带宽。我们提出了一种优化的小波变换(OWT),以改善算法中的语音感知掩蔽技术,以降低噪声效果。自适应小波选择,然后是优化的定性化利用强大的动态字典方案(DDS)来进行有效的压缩,同时保持语音议中性和感知质量。额外的无损编码技术不可避免地增加压缩比,同时提供信号的质量。最后,通过基于子带感知因子(SBPF)和感知熵(PE)来解压缩压缩信号通过施加Aglobal阈值来经历音调和噪声掩蔽,从而提高了Thesignal的质量。在归一化的根均方误差(NRMSE),压缩比(CR),语音质量(PESQ)的性能评估,重新构建失真长度(RDL),信号儿童比(SNR)中,获得了所提出的算法的性能。对于在低SNR条件下记录的各种浊音和清晰的信号。所有信号源自来自Noizeus数据库,并记录一些样本并标准化以在8kHz的采样频率下运行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号