Advanced audio coding with efficient psychoacoustic modeling

Data compression technique is an essential task for audio systems, which not only handles enormous amounts of data, but also requires the high quality resolution. One of these audio coding techniques, Moving Pictures Experts Group (MPEG) is powerful audio compression standardization. It can signific...

Full description

Bibliographic Details
Main Author: Sudeendra Maddur Gundurao.
Other Authors: Tan Yap Peng
Format: Thesis
Language:English
Published: 2009
Subjects:
Online Access:http://hdl.handle.net/10356/18785
_version_ 1811687915399413760
author Sudeendra Maddur Gundurao.
author2 Tan Yap Peng
author_facet Tan Yap Peng
Sudeendra Maddur Gundurao.
author_sort Sudeendra Maddur Gundurao.
collection NTU
description Data compression technique is an essential task for audio systems, which not only handles enormous amounts of data, but also requires the high quality resolution. One of these audio coding techniques, Moving Pictures Experts Group (MPEG) is powerful audio compression standardization. It can significantly reduce the requirements of transmission bandwidth and data storage, but with low distortion. This dissertation presents a new low complexity design of Psycho-Acoustic Model-2 (PAM), which is the key technology for a low power MPEG-2/4 Advance Audio Coding (ACC) encoding. The real-time constraint of MPEG ACC leads to a heavy computational bottleneck on today’s portable devices. To overcome this problem, design analysis and optimization of PAM are addressed. At algorithmic level, a new Modified-Discrete-Cosine-Transform-based (MDCT-based) PAM is designed and implemented concerning major reduction in complexity and also improving quality of the coded audio. In addition, the calculation of spreading function was replaced with look-up tables. The computational complexity of the proposed single transform New MDCT-based PAM (Model-C) could be reduced by more than 85% when compared to the classical FFT-based PAM (Model-A) and by around 40%when compared to the dual transform MDCT-based PAM (Model-B) suggested in [33]. The proposed new design makes it possible to implement the computationally intensive classical MPEG-2/4 AAC stereo encoder in real-time by sufficiently reducing its complexity.
first_indexed 2024-10-01T05:23:54Z
format Thesis
id ntu-10356/18785
institution Nanyang Technological University
language English
last_indexed 2024-10-01T05:23:54Z
publishDate 2009
record_format dspace
spelling ntu-10356/187852023-07-04T15:41:58Z Advanced audio coding with efficient psychoacoustic modeling Sudeendra Maddur Gundurao. Tan Yap Peng School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Data compression technique is an essential task for audio systems, which not only handles enormous amounts of data, but also requires the high quality resolution. One of these audio coding techniques, Moving Pictures Experts Group (MPEG) is powerful audio compression standardization. It can significantly reduce the requirements of transmission bandwidth and data storage, but with low distortion. This dissertation presents a new low complexity design of Psycho-Acoustic Model-2 (PAM), which is the key technology for a low power MPEG-2/4 Advance Audio Coding (ACC) encoding. The real-time constraint of MPEG ACC leads to a heavy computational bottleneck on today’s portable devices. To overcome this problem, design analysis and optimization of PAM are addressed. At algorithmic level, a new Modified-Discrete-Cosine-Transform-based (MDCT-based) PAM is designed and implemented concerning major reduction in complexity and also improving quality of the coded audio. In addition, the calculation of spreading function was replaced with look-up tables. The computational complexity of the proposed single transform New MDCT-based PAM (Model-C) could be reduced by more than 85% when compared to the classical FFT-based PAM (Model-A) and by around 40%when compared to the dual transform MDCT-based PAM (Model-B) suggested in [33]. The proposed new design makes it possible to implement the computationally intensive classical MPEG-2/4 AAC stereo encoder in real-time by sufficiently reducing its complexity. Master of Science (Signal Processing) 2009-07-17T08:50:15Z 2009-07-17T08:50:15Z 2008 2008 Thesis http://hdl.handle.net/10356/18785 en 72 p. application/pdf
spellingShingle DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
Sudeendra Maddur Gundurao.
Advanced audio coding with efficient psychoacoustic modeling
title Advanced audio coding with efficient psychoacoustic modeling
title_full Advanced audio coding with efficient psychoacoustic modeling
title_fullStr Advanced audio coding with efficient psychoacoustic modeling
title_full_unstemmed Advanced audio coding with efficient psychoacoustic modeling
title_short Advanced audio coding with efficient psychoacoustic modeling
title_sort advanced audio coding with efficient psychoacoustic modeling
topic DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
url http://hdl.handle.net/10356/18785
work_keys_str_mv AT sudeendramaddurgundurao advancedaudiocodingwithefficientpsychoacousticmodeling