VLSI Architecture for 8-Point AI-based Arai DCT having Low Area-Time Complexity and Power at Improved Accuracy

A low complexity digital VLSI architecture for the computation of an algebraic integer (AI) based 8-point Arai DCT algorithm is proposed. AI encoding schemes for exact representation of the Arai DCT transform based on a particularly sparse 2-D AI representation is reviewed, leading to the proposed n...

Full description

Bibliographic Details
Main Authors: Jithra Adikari, Renato J. Cintra, Arjuna Madanayake, Vassil S. Dimitrov, Amila Edirisuriya
Format: Article
Language:English
Published: MDPI AG 2012-03-01
Series:Journal of Low Power Electronics and Applications
Subjects:
Online Access:http://www.mdpi.com/2079-9268/2/2/127/
_version_ 1828359035228782592
author Jithra Adikari
Renato J. Cintra
Arjuna Madanayake
Vassil S. Dimitrov
Amila Edirisuriya
author_facet Jithra Adikari
Renato J. Cintra
Arjuna Madanayake
Vassil S. Dimitrov
Amila Edirisuriya
author_sort Jithra Adikari
collection DOAJ
description A low complexity digital VLSI architecture for the computation of an algebraic integer (AI) based 8-point Arai DCT algorithm is proposed. AI encoding schemes for exact representation of the Arai DCT transform based on a particularly sparse 2-D AI representation is reviewed, leading to the proposed novel architecture based on a new final reconstruction step (FRS) having lower complexity and higher accuracy compared to the state-of-the-art. This FRS is based on an optimization derived from expansion factors that leads to small integer constant-coefficient multiplications, which are realized with common sub-expression elimination (CSE) and Booth encoding. The reference circuit [1] as well as the proposed architectures for two expansion factors α† = 4.5958 and α′ = 167.2309 are implemented. The proposed circuits show 150% and 300% improvements in the number of DCT coefficients having error ≤ 0:1% compared to [1]. The three designs were realized using both 40 nm CMOS Xilinx Virtex-6 FPGAs and synthesized using 65 nm CMOS general purpose standard cells from TSMC. Post synthesis timing analysis of 65 nm CMOS realizations at 900 mV for all three designs of the 8-point DCT core for 8-bit inputs show potential real-time operation at 2.083 GHz clock frequency leading to a combined throughput of 2.083 billion 8-point Arai DCTs per second. The expansion-factor designs show a 43% reduction in area (A) and 29% reduction in dynamic power (PD) for FPGA realizations. An 11% reduction in area is observed for the ASIC design for α† = 4.5958 for an 8% reduction in total power (PT ). Our second ASIC design having α′ = 167.2309 shows marginal improvements in area and power compared to our reference design but at significantly better accuracy.
first_indexed 2024-04-14T03:36:53Z
format Article
id doaj.art-9d0cdf6a78624cab8cbf7fcc82489e6f
institution Directory Open Access Journal
issn 2079-9268
language English
last_indexed 2024-04-14T03:36:53Z
publishDate 2012-03-01
publisher MDPI AG
record_format Article
series Journal of Low Power Electronics and Applications
spelling doaj.art-9d0cdf6a78624cab8cbf7fcc82489e6f2022-12-22T02:14:43ZengMDPI AGJournal of Low Power Electronics and Applications2079-92682012-03-012212714210.3390/jlpea2020127VLSI Architecture for 8-Point AI-based Arai DCT having Low Area-Time Complexity and Power at Improved AccuracyJithra AdikariRenato J. CintraArjuna MadanayakeVassil S. DimitrovAmila EdirisuriyaA low complexity digital VLSI architecture for the computation of an algebraic integer (AI) based 8-point Arai DCT algorithm is proposed. AI encoding schemes for exact representation of the Arai DCT transform based on a particularly sparse 2-D AI representation is reviewed, leading to the proposed novel architecture based on a new final reconstruction step (FRS) having lower complexity and higher accuracy compared to the state-of-the-art. This FRS is based on an optimization derived from expansion factors that leads to small integer constant-coefficient multiplications, which are realized with common sub-expression elimination (CSE) and Booth encoding. The reference circuit [1] as well as the proposed architectures for two expansion factors α† = 4.5958 and α′ = 167.2309 are implemented. The proposed circuits show 150% and 300% improvements in the number of DCT coefficients having error ≤ 0:1% compared to [1]. The three designs were realized using both 40 nm CMOS Xilinx Virtex-6 FPGAs and synthesized using 65 nm CMOS general purpose standard cells from TSMC. Post synthesis timing analysis of 65 nm CMOS realizations at 900 mV for all three designs of the 8-point DCT core for 8-bit inputs show potential real-time operation at 2.083 GHz clock frequency leading to a combined throughput of 2.083 billion 8-point Arai DCTs per second. The expansion-factor designs show a 43% reduction in area (A) and 29% reduction in dynamic power (PD) for FPGA realizations. An 11% reduction in area is observed for the ASIC design for α† = 4.5958 for an 8% reduction in total power (PT ). Our second ASIC design having α′ = 167.2309 shows marginal improvements in area and power compared to our reference design but at significantly better accuracy.http://www.mdpi.com/2079-9268/2/2/127/video processingalgebraic integer quantizationDCTcompression
spellingShingle Jithra Adikari
Renato J. Cintra
Arjuna Madanayake
Vassil S. Dimitrov
Amila Edirisuriya
VLSI Architecture for 8-Point AI-based Arai DCT having Low Area-Time Complexity and Power at Improved Accuracy
Journal of Low Power Electronics and Applications
video processing
algebraic integer quantization
DCT
compression
title VLSI Architecture for 8-Point AI-based Arai DCT having Low Area-Time Complexity and Power at Improved Accuracy
title_full VLSI Architecture for 8-Point AI-based Arai DCT having Low Area-Time Complexity and Power at Improved Accuracy
title_fullStr VLSI Architecture for 8-Point AI-based Arai DCT having Low Area-Time Complexity and Power at Improved Accuracy
title_full_unstemmed VLSI Architecture for 8-Point AI-based Arai DCT having Low Area-Time Complexity and Power at Improved Accuracy
title_short VLSI Architecture for 8-Point AI-based Arai DCT having Low Area-Time Complexity and Power at Improved Accuracy
title_sort vlsi architecture for 8 point ai based arai dct having low area time complexity and power at improved accuracy
topic video processing
algebraic integer quantization
DCT
compression
url http://www.mdpi.com/2079-9268/2/2/127/
work_keys_str_mv AT jithraadikari vlsiarchitecturefor8pointaibasedaraidcthavinglowareatimecomplexityandpoweratimprovedaccuracy
AT renatojcintra vlsiarchitecturefor8pointaibasedaraidcthavinglowareatimecomplexityandpoweratimprovedaccuracy
AT arjunamadanayake vlsiarchitecturefor8pointaibasedaraidcthavinglowareatimecomplexityandpoweratimprovedaccuracy
AT vassilsdimitrov vlsiarchitecturefor8pointaibasedaraidcthavinglowareatimecomplexityandpoweratimprovedaccuracy
AT amilaedirisuriya vlsiarchitecturefor8pointaibasedaraidcthavinglowareatimecomplexityandpoweratimprovedaccuracy