Visual recognition using artificial intelligence (image inpainting with transformers)

As a major task in Computer Vision area, image inpainting is the process of filling in the missing part of an image. The traditional methods for image inpainting always struggle to address complex or large missing parts. In the last decade, the deep learning methods have made significant progress in...

Full description

Bibliographic Details
Main Author: Dou, Yuxiao
Other Authors: Yap Kim Hui
Format: Final Year Project (FYP)
Language:English
Published: Nanyang Technological University 2023
Subjects:
Online Access:https://hdl.handle.net/10356/167299
_version_ 1826122373707857920
author Dou, Yuxiao
author2 Yap Kim Hui
author_facet Yap Kim Hui
Dou, Yuxiao
author_sort Dou, Yuxiao
collection NTU
description As a major task in Computer Vision area, image inpainting is the process of filling in the missing part of an image. The traditional methods for image inpainting always struggle to address complex or large missing parts. In the last decade, the deep learning methods have made significant progress in this area. In this project, the potential of transformers in image inpainting is explored. Transformers have already demonstrated their outstanding global structure understanding ability in NLP, which could be quite useful in image inpainting as well. However, transformers’ computational inefficiency would be magnified when dealing with image data type. To overcome this weakness, a method combing both transformers and CNNs are explored and researched on. We achieve high performance in the Places2 and FFHQ datasets. Since the FFHQ dataset contains limited number of Asian-face images, an Asian-face dataset AFD-dataset was used to extend the application of the proposed method as well. To conclude, this project helps to further explore the possibility of transformers in image inpainting area and provides some useful data and information for the future research.
first_indexed 2024-10-01T05:47:26Z
format Final Year Project (FYP)
id ntu-10356/167299
institution Nanyang Technological University
language English
last_indexed 2024-10-01T05:47:26Z
publishDate 2023
publisher Nanyang Technological University
record_format dspace
spelling ntu-10356/1672992023-07-07T15:47:11Z Visual recognition using artificial intelligence (image inpainting with transformers) Dou, Yuxiao Yap Kim Hui School of Electrical and Electronic Engineering EKHYap@ntu.edu.sg Engineering::Electrical and electronic engineering As a major task in Computer Vision area, image inpainting is the process of filling in the missing part of an image. The traditional methods for image inpainting always struggle to address complex or large missing parts. In the last decade, the deep learning methods have made significant progress in this area. In this project, the potential of transformers in image inpainting is explored. Transformers have already demonstrated their outstanding global structure understanding ability in NLP, which could be quite useful in image inpainting as well. However, transformers’ computational inefficiency would be magnified when dealing with image data type. To overcome this weakness, a method combing both transformers and CNNs are explored and researched on. We achieve high performance in the Places2 and FFHQ datasets. Since the FFHQ dataset contains limited number of Asian-face images, an Asian-face dataset AFD-dataset was used to extend the application of the proposed method as well. To conclude, this project helps to further explore the possibility of transformers in image inpainting area and provides some useful data and information for the future research. Bachelor of Engineering (Electrical and Electronic Engineering) 2023-05-25T07:59:37Z 2023-05-25T07:59:37Z 2023 Final Year Project (FYP) Dou, Y. (2023). Visual recognition using artificial intelligence (image inpainting with transformers). Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/167299 https://hdl.handle.net/10356/167299 en application/pdf Nanyang Technological University
spellingShingle Engineering::Electrical and electronic engineering
Dou, Yuxiao
Visual recognition using artificial intelligence (image inpainting with transformers)
title Visual recognition using artificial intelligence (image inpainting with transformers)
title_full Visual recognition using artificial intelligence (image inpainting with transformers)
title_fullStr Visual recognition using artificial intelligence (image inpainting with transformers)
title_full_unstemmed Visual recognition using artificial intelligence (image inpainting with transformers)
title_short Visual recognition using artificial intelligence (image inpainting with transformers)
title_sort visual recognition using artificial intelligence image inpainting with transformers
topic Engineering::Electrical and electronic engineering
url https://hdl.handle.net/10356/167299
work_keys_str_mv AT douyuxiao visualrecognitionusingartificialintelligenceimageinpaintingwithtransformers