Exploiting the image prior in CLIP for super-resolution

Super-resolution (SR) is a fundamental task in computer vision aimed at enhancing the resolution and quality of low-resolution images. However, a persistent challenge arises from the inherent ambiguity where a single low-resolution image may correspond to mul- tiple high-resolution images. Additiona...

Full description

Bibliographic Details
Main Author:	Chen, Xingyu
Other Authors:	Chen Change Loy
Format:	Final Year Project (FYP)
Language:	English
Published:	Nanyang Technological University 2024
Subjects:	Computer and Information Science Super resolution Computer vision CLIP Deep learning
Online Access:	https://hdl.handle.net/10356/175133

_version_	1826115365273337856
author	Chen, Xingyu
author2	Chen Change Loy
author_facet	Chen Change Loy Chen, Xingyu
author_sort	Chen, Xingyu
collection	NTU
description	Super-resolution (SR) is a fundamental task in computer vision aimed at enhancing the resolution and quality of low-resolution images. However, a persistent challenge arises from the inherent ambiguity where a single low-resolution image may correspond to mul- tiple high-resolution images. Additional priors are essential to address such problem, especially when the degradation is complex. Recent emergence of large vision-language model such as CLIP provides potential to enhance SR generation by providing extra con- textual information from the image. Hence, in this project, we investigate the efficacy of integrating CLIP priors into image super-resolution. Through a series of experiments, we explore both blind and non-blind SR problems, evaluating the impact of CLIP priors on model performance. Additionally, we analyze the limitations and challenges associated with CLIP integration, particularly in handling low-resolution and incomplete images. Our findings demonstrate that while CLIP priors hold promise in enhancing SR results, careful fine-tuning is required to optimize their utilization for image generation tasks.
first_indexed	2024-10-01T03:54:16Z
format	Final Year Project (FYP)
id	ntu-10356/175133
institution	Nanyang Technological University
language	English
last_indexed	2024-10-01T03:54:16Z
publishDate	2024
publisher	Nanyang Technological University
record_format	dspace
spelling	ntu-10356/1751332024-04-26T15:40:52Z Exploiting the image prior in CLIP for super-resolution Chen, Xingyu Chen Change Loy School of Computer Science and Engineering ccloy@ntu.edu.sg Computer and Information Science Super resolution Computer vision CLIP Deep learning Super-resolution (SR) is a fundamental task in computer vision aimed at enhancing the resolution and quality of low-resolution images. However, a persistent challenge arises from the inherent ambiguity where a single low-resolution image may correspond to mul- tiple high-resolution images. Additional priors are essential to address such problem, especially when the degradation is complex. Recent emergence of large vision-language model such as CLIP provides potential to enhance SR generation by providing extra con- textual information from the image. Hence, in this project, we investigate the efficacy of integrating CLIP priors into image super-resolution. Through a series of experiments, we explore both blind and non-blind SR problems, evaluating the impact of CLIP priors on model performance. Additionally, we analyze the limitations and challenges associated with CLIP integration, particularly in handling low-resolution and incomplete images. Our findings demonstrate that while CLIP priors hold promise in enhancing SR results, careful fine-tuning is required to optimize their utilization for image generation tasks. Bachelor's degree 2024-04-22T02:49:53Z 2024-04-22T02:49:53Z 2024 Final Year Project (FYP) Chen, X. (2024). Exploiting the image prior in CLIP for super-resolution. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175133 https://hdl.handle.net/10356/175133 en SCSE23-0477 application/pdf Nanyang Technological University
spellingShingle	Computer and Information Science Super resolution Computer vision CLIP Deep learning Chen, Xingyu Exploiting the image prior in CLIP for super-resolution
title	Exploiting the image prior in CLIP for super-resolution
title_full	Exploiting the image prior in CLIP for super-resolution
title_fullStr	Exploiting the image prior in CLIP for super-resolution
title_full_unstemmed	Exploiting the image prior in CLIP for super-resolution
title_short	Exploiting the image prior in CLIP for super-resolution
title_sort	exploiting the image prior in clip for super resolution
topic	Computer and Information Science Super resolution Computer vision CLIP Deep learning
url	https://hdl.handle.net/10356/175133
work_keys_str_mv	AT chenxingyu exploitingtheimagepriorinclipforsuperresolution

Exploiting the image prior in CLIP for super-resolution

Similar Items