首页> 外文学位 >Optimal delayed decisions in encoding and decoding of audio signals and general sources.

【24h】

Optimal delayed decisions in encoding and decoding of audio signals and general sources.

机译：音频信号和一般信号源的编码和解码中的最佳延迟决策。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This dissertation is concerned with algorithms that optimally exploit delay for encoding or decoding decisions in certain common scenarios involving signal compression.;In applications that involve off-line encoding, such as movie-streaming over the internet, music playback from hand-held devices, and so on, the end-user is not sensitive to encoding delay. Despite this fact, encoders typically compress frame after frame of the signal, thereby restricting encoding delay. As one focus of this dissertation, delayed-decision approaches are explored, to optimize the encoding operation over the entire signal. Standards based audio-compression is chosen as the candidate setting to demonstrate the benefits of the concept. A two-layered trellis effectively optimizes both intra- and inter-frame encoding decisions while minimizing a psychoacoustically relevant distortion measure under a prescribed bit-rate constraint. The bit-stream produced is standard compatible and there is no additional decoding delay. As an accompaniment to this rate-distortion optimization paradigm, and motivated by it, modifications are proposed to the audio distortion metric itself that enhance its psychoacoustic relevance, and endeavor to enable subjectively optimal decisions.;Subsequently the focus shifts to delay at the decoder end of the compression chain. Unlike at the encoder, there are no parameter choices to make. But can the decoder, by suitable application of delay, exploit correlations if any with future frames to improve the reconstruction of the current frame? This question is particularly relevant in predictive coding scenarios, where a correlated source model is explicitly assumed. The encoder predicts the current sample from the past, and codes the prediction residual. Correlations with future samples can be exploited at the decoder end, for instance by applying a non-causal filter to smooth the regular zero-delay reconstructions. In contrast, this dissertation proposes an estimation-theoretic framework where conditional probability densities, given both past and available future information (for a fixed delay), are recursively calculated, and optimal reconstruction computed via conditional expectation. This optimal delayed decoder in turn motivates a near-optimal low complexity approximation, that employs a time-invariant look-up table or codebook approach. Applications include video compression employing motion compensated prediction, and so called 'low-delay' applications, where predictive coding is used in lieu of transform coding to avoid large framing delays and encoding complexity.

机译：本发明涉及在某些涉及信号压缩的常见情况下，最优地利用延迟来进行编码或解码决策的算法。在涉及离线编码的应用中，例如互联网上的电影流，从手持设备播放音乐，依此类推，最终用户对编码延迟不敏感。尽管有这个事实，编码器通常会在信号的帧之后压缩帧，从而限制了编码延迟。作为本论文的重点之一，探索了延迟决策方法，以优化整个信号上的编码操作。选择基于标准的音频压缩作为候选设置，以演示该概念的好处。两层网格有效地优化了帧内和帧间编码决策，同时在规定的比特率约束下将心理听觉上相关的失真度量最小化。产生的比特流是标准兼容的，没有额外的解码延迟。作为此速率失真优化范例的一种伴奏，并以此为动机，提出了对音频失真度量本身的修改，以增强其心理声学相关性，并努力实现主观上的最佳决策。随后，焦点转移到解码器端以延迟压缩链。与编码器不同，没有参数可供选择。但是，解码器是否可以通过适当地应用延迟来利用与未来帧之间的相关性（如果有）来改善当前帧的重构？这个问题在预测性编码方案中特别重要，在该方案中，显式假定了相关的源模型。编码器根据过去预测当前样本，并对预测残差进行编码。可以在解码器端利用与未来样本的相关性，例如通过应用非因果滤波器来平滑常规零延迟重构。相比之下，本文提出了一种估计理论框架，其中递归计算既有过去信息又有可用未来信息（针对固定的延迟）的条件概率密度，并通过条件期望来计算最优重构。该最佳延迟解码器又促使采用接近最佳的低复杂度近似，该近似采用时不变查找表或码本方法。应用包括采用运动补偿预测的视频压缩，以及所谓的“低延迟”应用，其中使用预测编码代替变换编码来避免大的成帧延迟和编码复杂性。

著录项

作者
Melkote Krishnaprasad, Vinay.;
展开▼
作者单位

University of California, Santa Barbara.;

展开▼
授予单位 University of California, Santa Barbara.;
学科 Engineering Computer.;Engineering Electronics and Electrical.;Information Technology.
学位 Ph.D.
年度 2010
页码 147 p.
总页数 147
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Audio signal quality enhancement using multi-layered convolutional neural network based auto encoder-decoder [J] . Raj Shivangi, Prakasam P., Gupta Shubham International journal of speech technology . 2021,第2期

机译：基于多层卷积神经网络的自动编码器 - 解码器音频信号质量增强
2. Optimal Tracking Performance of NCSs with Time-delay and Encoding-decoding Constraints [J] . Hu Jun-Wei, Zhan Xi-Sheng, Wu Jie, International Journal of Control, Automation, and Systems . 2020,第4期

机译：具有延时和编码解码约束的NCSS的最佳跟踪性能
3. Optimal Tracking Performance of NCSs with Time-delay and Encoding-decoding Constraints [J] . Analysis and applications . 2020,第4期

机译：具有延时和编码解码约束的NCSS的最佳跟踪性能
4. ISO/MPEG layer 2-optimum re-encoding of decoded audio using a MOLE signal [C] . John Fletcher . -1

机译：使用MOLE信号对解码音频进行ISO / MPEG 2层2最佳重编码
5. Decoding Calcium Encoding Through Bi-Directional Optogenetic Control over Gq-Protein Signaling [D] . Hannanta-anan, Pimkhuan. 2018

机译：通过GQ-蛋白信号传导通过双向致敏控制解码钙
6. Do audio‐guided decision aids improve outcomes? A randomized controlled trial of an audio‐guided decision aid compared with a booklet decision aid for Australian women considering labour analgesia [O] . Camille H. Raynes‐Greenow, Christine L. Roberts, Natasha Nassar, 2009

机译：语音指导的决策辅助工具是否可以改善结果？对于考虑分娩镇痛的澳大利亚妇女采用语音指导决策辅助技术和小册子决策辅助技术的随机对照试验
7. OPTIMAL DELAYED DECODING OF PREDICTIVELY ENCODED SOURCES [O] . Vinay Melkote, Kenneth Rose 2013

机译：最优编码源的最佳延迟解码
8. Digital Encoding of Speech and Audio Signals Based on the Perceptual Requirements of the Auditory System [R] . Krasner, M. A. 1979

机译：基于听觉系统感知要求的语音和音频信号数字编码

Optimal delayed decisions in encoding and decoding of audio signals and general sources.

摘要

著录项

相似文献

相关主题

期刊订阅