Joint Algorithm-Architecture Optimization of CABAC

This paper uses joint algorithm and architecture design to enable high coding efficiency in conjunction with high processing speed and low area cost. Specifically, it presents several optimizations that can be performed on Context Adaptive Binary Arithmetic Coding (CABAC), a form of entropy coding u...

Full description

Bibliographic Details
Main Authors: Sze, Vivienne, Chandrakasan, Anantha P.
Other Authors: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Format: Article
Language:en_US
Published: Springer-Verlag 2015
Online Access:http://hdl.handle.net/1721.1/92827
https://orcid.org/0000-0002-5977-2748
https://orcid.org/0000-0003-4841-3990
_version_ 1826206984242724864
author Sze, Vivienne
Chandrakasan, Anantha P.
author2 Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
author_facet Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Sze, Vivienne
Chandrakasan, Anantha P.
author_sort Sze, Vivienne
collection MIT
description This paper uses joint algorithm and architecture design to enable high coding efficiency in conjunction with high processing speed and low area cost. Specifically, it presents several optimizations that can be performed on Context Adaptive Binary Arithmetic Coding (CABAC), a form of entropy coding used in H.264/AVC, to achieve the throughput necessary for real-time low power high definition video coding. The combination of syntax element partitions and interleaved entropy slices, referred to as Massively Parallel CABAC, increases the number of binary symbols that can be processed in a cycle. Subinterval reordering is used to reduce the cycle time required to process each binary symbol. Under common conditions using the JM12.0 software, the Massively Parallel CABAC, increases the bins per cycle by 2.7 to 32.8× at a cost of 0.25 to 6.84% coding loss compared with sequential single slice H.264/AVC CABAC. It also provides a 2× reduction in area cost, and reduces memory bandwidth. Subinterval reordering reduces the critical path delay by 14 to 22%, while modifications to context selection reduces the memory requirement by 67%. This work demonstrates that accounting for implementation cost during video coding algorithms design can enable higher processing speed and reduce hardware cost, while still delivering high coding efficiency in the next generation video coding standard.
first_indexed 2024-09-23T13:41:58Z
format Article
id mit-1721.1/92827
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T13:41:58Z
publishDate 2015
publisher Springer-Verlag
record_format dspace
spelling mit-1721.1/928272022-09-28T15:35:37Z Joint Algorithm-Architecture Optimization of CABAC Sze, Vivienne Chandrakasan, Anantha P. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Sze, Vivienne Sze, Vivienne Chandrakasan, Anantha P. This paper uses joint algorithm and architecture design to enable high coding efficiency in conjunction with high processing speed and low area cost. Specifically, it presents several optimizations that can be performed on Context Adaptive Binary Arithmetic Coding (CABAC), a form of entropy coding used in H.264/AVC, to achieve the throughput necessary for real-time low power high definition video coding. The combination of syntax element partitions and interleaved entropy slices, referred to as Massively Parallel CABAC, increases the number of binary symbols that can be processed in a cycle. Subinterval reordering is used to reduce the cycle time required to process each binary symbol. Under common conditions using the JM12.0 software, the Massively Parallel CABAC, increases the bins per cycle by 2.7 to 32.8× at a cost of 0.25 to 6.84% coding loss compared with sequential single slice H.264/AVC CABAC. It also provides a 2× reduction in area cost, and reduces memory bandwidth. Subinterval reordering reduces the critical path delay by 14 to 22%, while modifications to context selection reduces the memory requirement by 67%. This work demonstrates that accounting for implementation cost during video coding algorithms design can enable higher processing speed and reduce hardware cost, while still delivering high coding efficiency in the next generation video coding standard. Texas Instruments Incorporated (Graduate Women's Fellowship for Leadership in Microelectronics) Natural Sciences and Engineering Research Council of Canada 2015-01-13T18:23:52Z 2015-01-13T18:23:52Z 2012-05 2012-03 Article http://purl.org/eprint/type/JournalArticle 1939-8018 1939-8115 http://hdl.handle.net/1721.1/92827 Sze, Vivienne, and Anantha P. Chandrakasan. “Joint Algorithm-Architecture Optimization of CABAC.” Journal of Signal Processing Systems 69, no. 3 (May 26, 2012): 239–252. https://orcid.org/0000-0002-5977-2748 https://orcid.org/0000-0003-4841-3990 en_US http://dx.doi.org/10.1007/s11265-012-0678-2 Journal of Signal Processing Systems Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf Springer-Verlag Sze
spellingShingle Sze, Vivienne
Chandrakasan, Anantha P.
Joint Algorithm-Architecture Optimization of CABAC
title Joint Algorithm-Architecture Optimization of CABAC
title_full Joint Algorithm-Architecture Optimization of CABAC
title_fullStr Joint Algorithm-Architecture Optimization of CABAC
title_full_unstemmed Joint Algorithm-Architecture Optimization of CABAC
title_short Joint Algorithm-Architecture Optimization of CABAC
title_sort joint algorithm architecture optimization of cabac
url http://hdl.handle.net/1721.1/92827
https://orcid.org/0000-0002-5977-2748
https://orcid.org/0000-0003-4841-3990
work_keys_str_mv AT szevivienne jointalgorithmarchitectureoptimizationofcabac
AT chandrakasanananthap jointalgorithmarchitectureoptimizationofcabac