Improved Psychoacoustic Model for Efficient Perceptual Audio Codecs

机译：有效听觉音频编解码器的改进心理声学模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Since early perceptual audio coders such as mp3, the underlying psychoacoustic model that controls the encoding process has not undergone many dramatic changes. Meanwhile, modern audio coders have been equipped with semi-parametric or parametric coding tools such as audio bandwidth extension. Thereby, the initial psychoacoustic model used in a perceptual coder, just considering added quantisation noise, became partly unsuitable. We propose the use of an improved psychoacoustic excitation model based on an existing model devised by Dau et al. in 1997. This modulation based model is essentially independent from the precise input waveform by calculating an internal auditory representation. Using the example of MPEG-H 3D Audio and its semi-parametric Intelligent Gap Filling (IGF) tool, we demonstrate that we can successfully control the IGF parameter selection process to achieve overall improved perceptual quality.

机译：自从诸如mp3之类的早期感知音频编码器以来，控制编码过程的底层心理声学模型并未发生太多戏剧性的变化。同时，现代音频编码器已经配备了半参数或参数编码工具，例如音频带宽扩展。因此，仅考虑增加的量化噪声，在感知编码器中使用的初始心理声学模型变得部分不合适。我们建议使用基于Dau等人设计的现有模型的改进的心理声学激励模型。 1997年，这种基于调制的模型通过计算内部听觉表示，基本上独立于精确的输入波形。使用MPEG-H 3D音频及其半参数智能间隙填充（IGF）工具的示例，我们证明了我们可以成功地控制IGF参数选择过程，以实现整体上改善的感知质量。

著录项

来源
《Audio Engineering Society international convention》|2018年|214-223|共10页
会议地点
作者
Sascha Disch; Steven van de Par; Andreas Niedermeier; Elena Burdiel Perez; Ane Berasategui Ceberio; Bernd Edler;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Audio Compression Codec Using a Dynamic Gammachirp Psychoacoustic Model And a D.W.T Multiresolution Analysis [J] . Khalil Abid, Kais Ouni, Noureddine Ellouze International Journal on Computer Science and Engineering . 2010,第4期

机译：动态Gammachirp心理声学模型和D.W.T多分辨率分析的音频压缩编解码器
2. Perceptual filter design for audio coding using psychoacoustic modelling [J] . Lam Y.H., Stewart R.W. Electronics Letters . 1998,第8期

机译：使用心理声学建模的音频编码感知滤波器设计
3. Perceptual Spatial Audio Recording, Simulation, and Rendering: An overview of spatial-audio techniques based on psychoacoustics [J] . Huseyin Hacihabiboglu, Enzo De Sena, Zoran Cvetkovic, IEEE Signal Processing Magazine . 2017,第3期

机译：感知空间音频记录，模拟和渲染：基于心理声学的空间音频技术概述
4. Improved Psychoacoustic Model for Efficient Perceptual Audio Codecs [C] . Sascha Disch, Steven van de Par, Andreas Niedermeier, Audio Engineering Society Convention . 2018

机译：改进了高效感知音频编解码器的心理声学模型
5. Embedding perceptual linear prediction models in speech and audio coding. [D] . Atti, Venkatraman S. 2006

机译：在语音和音频编码中嵌入感知线性预测模型。
6. Musical Training Improves Audiovisual Integration Capacity under Conditions of High Perceptual Load [O] . Jonathan M. P. Wilbiks, Courtney O’Brien 2020

机译：音乐训练可提高高知觉负载条件下的视听整合能力
7. Warped linear prediction for improved perceptual quality in the SCELP low delay audio codec [O] . Krüger Hauke, Vary Peter 2007

机译：扭曲线性预测可提高SCELP低延迟音频编解码器的感知质量

Improved Psychoacoustic Model for Efficient Perceptual Audio Codecs

摘要

著录项

相似文献

相关主题

期刊订阅