Lightweight generative adversarial networks for text-guided image manipulation

We propose a novel lightweight generative adversarial network for efficient image manipulation using natural language descriptions. To achieve this, a new word-level discriminator is proposed, which provides the generator with fine-grained training feedback at word-level, to facilitate training a li...

Täydet tiedot

Bibliografiset tiedot
Päätekijät: Li, B, Qi, X, Torr, PHS, Lukasiewicz, T
Aineistotyyppi: Conference item
Kieli:English
Julkaistu: NeurIPS 2020
_version_ 1826288053268774912
author Li, B
Qi, X
Torr, PHS
Lukasiewicz, T
author_facet Li, B
Qi, X
Torr, PHS
Lukasiewicz, T
author_sort Li, B
collection OXFORD
description We propose a novel lightweight generative adversarial network for efficient image manipulation using natural language descriptions. To achieve this, a new word-level discriminator is proposed, which provides the generator with fine-grained training feedback at word-level, to facilitate training a lightweight generator that has a small number of parameters, but can still correctly focus on specific visual attributes of an image, and then edit them without affecting other contents that are not described in the text. Furthermore, thanks to the explicit training signal related to each word, the discriminator can also be simplified to have a lightweight structure. Compared with the state of the art, our method has a much smaller number of parameters, but still achieves a competitive manipulation performance. Extensive experimental results demonstrate that our method can better disentangle different visual attributes, then correctly map them to corresponding semantic words, and thus achieve a more accurate image modification using natural language descriptions.
first_indexed 2024-03-07T02:07:55Z
format Conference item
id oxford-uuid:9f9f00eb-4f7c-4e02-ac38-54348f05fde5
institution University of Oxford
language English
last_indexed 2024-03-07T02:07:55Z
publishDate 2020
publisher NeurIPS
record_format dspace
spelling oxford-uuid:9f9f00eb-4f7c-4e02-ac38-54348f05fde52022-03-27T00:59:16ZLightweight generative adversarial networks for text-guided image manipulationConference itemhttp://purl.org/coar/resource_type/c_5794uuid:9f9f00eb-4f7c-4e02-ac38-54348f05fde5EnglishSymplectic ElementsNeurIPS2020Li, BQi, XTorr, PHSLukasiewicz, TWe propose a novel lightweight generative adversarial network for efficient image manipulation using natural language descriptions. To achieve this, a new word-level discriminator is proposed, which provides the generator with fine-grained training feedback at word-level, to facilitate training a lightweight generator that has a small number of parameters, but can still correctly focus on specific visual attributes of an image, and then edit them without affecting other contents that are not described in the text. Furthermore, thanks to the explicit training signal related to each word, the discriminator can also be simplified to have a lightweight structure. Compared with the state of the art, our method has a much smaller number of parameters, but still achieves a competitive manipulation performance. Extensive experimental results demonstrate that our method can better disentangle different visual attributes, then correctly map them to corresponding semantic words, and thus achieve a more accurate image modification using natural language descriptions.
spellingShingle Li, B
Qi, X
Torr, PHS
Lukasiewicz, T
Lightweight generative adversarial networks for text-guided image manipulation
title Lightweight generative adversarial networks for text-guided image manipulation
title_full Lightweight generative adversarial networks for text-guided image manipulation
title_fullStr Lightweight generative adversarial networks for text-guided image manipulation
title_full_unstemmed Lightweight generative adversarial networks for text-guided image manipulation
title_short Lightweight generative adversarial networks for text-guided image manipulation
title_sort lightweight generative adversarial networks for text guided image manipulation
work_keys_str_mv AT lib lightweightgenerativeadversarialnetworksfortextguidedimagemanipulation
AT qix lightweightgenerativeadversarialnetworksfortextguidedimagemanipulation
AT torrphs lightweightgenerativeadversarialnetworksfortextguidedimagemanipulation
AT lukasiewiczt lightweightgenerativeadversarialnetworksfortextguidedimagemanipulation