Loop unroll optimization for GPU implementation

This report presents the process of implementation and optimization of two image resize algorithms namely, Bilinear and Bicubic Interpolation. The purpose of the optimization seeks to improve execution time and is primarily done with the use of Nvidia’s Compute Unified Device Architecture (CUDA). Bo...

Full description

Bibliographic Details
Main Author: Wu, Jianghua.
Other Authors: School of Computer Engineering
Format: Final Year Project (FYP)
Language:English
Published: 2012
Subjects:
Online Access:http://hdl.handle.net/10356/48561