...
首页> 外文期刊>Journal of Zhejiang university science >High throughput VLSI architecture for H.264/AVC context-based adaptive binary arithmetic coding (CABAC) decoding
【24h】

High throughput VLSI architecture for H.264/AVC context-based adaptive binary arithmetic coding (CABAC) decoding

机译:高吞吐量VLSI架构,用于基于H.264 / AVC上下文的自适应二进制算术编码(CABAC)解码

获取原文
           

摘要

context-based adaptive binary arithmetic coding (CABAC) is the major entropy-coding algorithm employed in h.264/AVC. In this paper, we present a new VLSI architecture design for an h.264/AVC CABAC decoder, which optimizes both decode decision and decode bypass engines for high throughput, and improves context model allocation for efficient external memory access. Based on the fact that the most possible symbol (MPS) branch is much simpler than the least possible symbol (LPS) branch, a newly organized decode decision engine consisting of two serially concatenated MPS branches and one LPS branch is proposed to achieve better parallelism at lower timing path cost. A look-ahead context index (ctxIdx) calculation mechanism is designed to provide the context model for the second MPS branch. A head-zero detector is proposed to improve the performance of the decode bypass engine according to UEGk encoding features. In addition, to lower the frequency of memory access, we reorganize the context models in external memory and use three circular buffers to cache the context models, neighboring information, and bit stream, respectively. A pre-fetching mechanism with a prediction scheme is adopted to load the corresponding content to a circular buffer to hide external memory latency. Experimental results show that our design can operate at 250 MHz with a 20.71k gate count in SMIC18 silicon technology, and that it achieves an average data decoding rate of 1.5 bins/cycle.
机译:基于上下文的自适应二进制算术编码(CABAC)是h.264 / AVC中使用的主要熵编码算法。在本文中,我们为h.264 / AVC CABAC解码器提供了一种新的VLSI架构设计,该设计可优化解码决策和解码旁路引擎以实现高吞吐量,并改善上下文模型分配以实现有效的外部存储器访问。基于最可能的符号(MPS)分支比最不可能的符号(LPS)分支简单得多的事实,提出了一种新组织的由两个串行串联的MPS分支和一个LPS分支组成的解码决策引擎,以在降低时序路径成本。预设计上下文索引(ctxIdx)计算机制旨在为第二个MPS分支提供上下文模型。提出了零头检测器以根据UEGk编码特征来改善解码旁路引擎的性能。此外,为了降低内存访问的频率,我们在外部存储器中重新组织了上下文模型,并使用三个循环缓冲区分别缓存了上下文模型,相邻信息和位流。采用具有预测方案的预取机制将相应的内容加载到循环缓冲区中以隐藏外部存储器等待时间。实验结果表明,我们的设计可以在SMIC18硅技术中以20.71k的门数在250 MHz下工作,并且平均数据解码速率为1.5 bins / cycle。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号