Joint Algorithm-Architecture Optimization of CABAC

This paper uses joint algorithm and architecture design to enable high coding efficiency in conjunction with high processing speed and low area cost. Specifically, it presents several optimizations that can be performed on Context Adaptive Binary Arithmetic Coding (CABAC), a form of entropy coding u...

Full description

Bibliographic Details
Main Authors:	Sze, Vivienne, Chandrakasan, Anantha P.
Other Authors:	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Format:	Article
Language:	en_US
Published:	Springer-Verlag 2015
Online Access:	http://hdl.handle.net/1721.1/92827 https://orcid.org/0000-0002-5977-2748 https://orcid.org/0000-0003-4841-3990

_version_	1826206984242724864
author	Sze, Vivienne Chandrakasan, Anantha P.
author2	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
author_facet	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Sze, Vivienne Chandrakasan, Anantha P.
author_sort	Sze, Vivienne
collection	MIT
description	This paper uses joint algorithm and architecture design to enable high coding efficiency in conjunction with high processing speed and low area cost. Specifically, it presents several optimizations that can be performed on Context Adaptive Binary Arithmetic Coding (CABAC), a form of entropy coding used in H.264/AVC, to achieve the throughput necessary for real-time low power high definition video coding. The combination of syntax element partitions and interleaved entropy slices, referred to as Massively Parallel CABAC, increases the number of binary symbols that can be processed in a cycle. Subinterval reordering is used to reduce the cycle time required to process each binary symbol. Under common conditions using the JM12.0 software, the Massively Parallel CABAC, increases the bins per cycle by 2.7 to 32.8× at a cost of 0.25 to 6.84% coding loss compared with sequential single slice H.264/AVC CABAC. It also provides a 2× reduction in area cost, and reduces memory bandwidth. Subinterval reordering reduces the critical path delay by 14 to 22%, while modifications to context selection reduces the memory requirement by 67%. This work demonstrates that accounting for implementation cost during video coding algorithms design can enable higher processing speed and reduce hardware cost, while still delivering high coding efficiency in the next generation video coding standard.
first_indexed	2024-09-23T13:41:58Z
format	Article
id	mit-1721.1/92827
institution	Massachusetts Institute of Technology
language	en_US
last_indexed	2024-09-23T13:41:58Z
publishDate	2015
publisher	Springer-Verlag
record_format	dspace
spelling	mit-1721.1/928272022-09-28T15:35:37Z Joint Algorithm-Architecture Optimization of CABAC Sze, Vivienne Chandrakasan, Anantha P. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Sze, Vivienne Sze, Vivienne Chandrakasan, Anantha P. This paper uses joint algorithm and architecture design to enable high coding efficiency in conjunction with high processing speed and low area cost. Specifically, it presents several optimizations that can be performed on Context Adaptive Binary Arithmetic Coding (CABAC), a form of entropy coding used in H.264/AVC, to achieve the throughput necessary for real-time low power high definition video coding. The combination of syntax element partitions and interleaved entropy slices, referred to as Massively Parallel CABAC, increases the number of binary symbols that can be processed in a cycle. Subinterval reordering is used to reduce the cycle time required to process each binary symbol. Under common conditions using the JM12.0 software, the Massively Parallel CABAC, increases the bins per cycle by 2.7 to 32.8× at a cost of 0.25 to 6.84% coding loss compared with sequential single slice H.264/AVC CABAC. It also provides a 2× reduction in area cost, and reduces memory bandwidth. Subinterval reordering reduces the critical path delay by 14 to 22%, while modifications to context selection reduces the memory requirement by 67%. This work demonstrates that accounting for implementation cost during video coding algorithms design can enable higher processing speed and reduce hardware cost, while still delivering high coding efficiency in the next generation video coding standard. Texas Instruments Incorporated (Graduate Women's Fellowship for Leadership in Microelectronics) Natural Sciences and Engineering Research Council of Canada 2015-01-13T18:23:52Z 2015-01-13T18:23:52Z 2012-05 2012-03 Article http://purl.org/eprint/type/JournalArticle 1939-8018 1939-8115 http://hdl.handle.net/1721.1/92827 Sze, Vivienne, and Anantha P. Chandrakasan. “Joint Algorithm-Architecture Optimization of CABAC.” Journal of Signal Processing Systems 69, no. 3 (May 26, 2012): 239–252. https://orcid.org/0000-0002-5977-2748 https://orcid.org/0000-0003-4841-3990 en_US http://dx.doi.org/10.1007/s11265-012-0678-2 Journal of Signal Processing Systems Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf Springer-Verlag Sze
spellingShingle	Sze, Vivienne Chandrakasan, Anantha P. Joint Algorithm-Architecture Optimization of CABAC
title	Joint Algorithm-Architecture Optimization of CABAC
title_full	Joint Algorithm-Architecture Optimization of CABAC
title_fullStr	Joint Algorithm-Architecture Optimization of CABAC
title_full_unstemmed	Joint Algorithm-Architecture Optimization of CABAC
title_short	Joint Algorithm-Architecture Optimization of CABAC
title_sort	joint algorithm architecture optimization of cabac
url	http://hdl.handle.net/1721.1/92827 https://orcid.org/0000-0002-5977-2748 https://orcid.org/0000-0003-4841-3990
work_keys_str_mv	AT szevivienne jointalgorithmarchitectureoptimizationofcabac AT chandrakasanananthap jointalgorithmarchitectureoptimizationofcabac

Joint Algorithm-Architecture Optimization of CABAC

Similar Items