首页> 外文学位 >Optimal delayed decisions in encoding and decoding of audio signals and general sources.
【24h】

Optimal delayed decisions in encoding and decoding of audio signals and general sources.

机译:音频信号和一般信号源的编码和解码中的最佳延迟决策。

获取原文
获取原文并翻译 | 示例

摘要

This dissertation is concerned with algorithms that optimally exploit delay for encoding or decoding decisions in certain common scenarios involving signal compression.;In applications that involve off-line encoding, such as movie-streaming over the internet, music playback from hand-held devices, and so on, the end-user is not sensitive to encoding delay. Despite this fact, encoders typically compress frame after frame of the signal, thereby restricting encoding delay. As one focus of this dissertation, delayed-decision approaches are explored, to optimize the encoding operation over the entire signal. Standards based audio-compression is chosen as the candidate setting to demonstrate the benefits of the concept. A two-layered trellis effectively optimizes both intra- and inter-frame encoding decisions while minimizing a psychoacoustically relevant distortion measure under a prescribed bit-rate constraint. The bit-stream produced is standard compatible and there is no additional decoding delay. As an accompaniment to this rate-distortion optimization paradigm, and motivated by it, modifications are proposed to the audio distortion metric itself that enhance its psychoacoustic relevance, and endeavor to enable subjectively optimal decisions.;Subsequently the focus shifts to delay at the decoder end of the compression chain. Unlike at the encoder, there are no parameter choices to make. But can the decoder, by suitable application of delay, exploit correlations if any with future frames to improve the reconstruction of the current frame? This question is particularly relevant in predictive coding scenarios, where a correlated source model is explicitly assumed. The encoder predicts the current sample from the past, and codes the prediction residual. Correlations with future samples can be exploited at the decoder end, for instance by applying a non-causal filter to smooth the regular zero-delay reconstructions. In contrast, this dissertation proposes an estimation-theoretic framework where conditional probability densities, given both past and available future information (for a fixed delay), are recursively calculated, and optimal reconstruction computed via conditional expectation. This optimal delayed decoder in turn motivates a near-optimal low complexity approximation, that employs a time-invariant look-up table or codebook approach. Applications include video compression employing motion compensated prediction, and so called 'low-delay' applications, where predictive coding is used in lieu of transform coding to avoid large framing delays and encoding complexity.
机译:本发明涉及在某些涉及信号压缩的常见情况下,最优地利用延迟来进行编码或解码决策的算法。在涉及离线编码的应用中,例如互联网上的电影流,从手持设备播放音乐,依此类推,最终用户对编码延迟不敏感。尽管有这个事实,编码器通常会在信号的帧之后压缩帧,从而限制了编码延迟。作为本论文的重点之一,探索了延迟决策方法,以优化整个信号上的编码操作。选择基于标准的音频压缩作为候选设置,以演示该概念的好处。两层网格有效地优化了帧内和帧间编码决策,同时在规定的比特率约束下将心理听觉上相关的失真度量最小化。产生的比特流是标准兼容的,没有额外的解码延迟。作为此速率失真优化范例的一种伴奏,并以此为动机,提出了对音频失真度量本身的修改,以增强其心理声学相关性,并努力实现主观上的最佳决策。随后,焦点转移到解码器端以延迟压缩链。与编码器不同,没有参数可供选择。但是,解码器是否可以通过适当地应用延迟来利用与未来帧之间的相关性(如果有)来改善当前帧的重构?这个问题在预测性编码方案中特别重要,在该方案中,显式假定了相关的源模型。编码器根据过去预测当前样本,并对预测残差进行编码。可以在解码器端利用与未来样本的相关性,例如通过应用非因果滤波器来平滑常规零延迟重构。相比之下,本文提出了一种估计理论框架,其中递归计算既有过去信息又有可用未来信息(针对固定的延迟)的条件概率密度,并通过条件期望来计算最优重构。该最佳延迟解码器又促使采用接近最佳的低复杂度近似,该近似采用时不变查找表或码本方法。应用包括采用运动补偿预测的视频压缩,以及所谓的“低延迟”应用,其中使用预测编码代替变换编码来避免大的成帧延迟和编码复杂性。

著录项

  • 作者单位

    University of California, Santa Barbara.;

  • 授予单位 University of California, Santa Barbara.;
  • 学科 Engineering Computer.;Engineering Electronics and Electrical.;Information Technology.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 147 p.
  • 总页数 147
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号