Pseudo-Supervised Learning for Semantic Multi-Style Transfer

Numerous methods for style transfer have been developed using unsupervised learning and gained impressive results. However, optimal style transfer cannot be conducted from a global fashion in certain style domains, mainly when a single target-style domain contains semantic objects that have their ow...

Full description

Bibliographic Details
Main Authors: Saehun Kim, Jeonghyeok Do, Munchurl Kim
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9316188/
_version_ 1818479881764208640
author Saehun Kim
Jeonghyeok Do
Munchurl Kim
author_facet Saehun Kim
Jeonghyeok Do
Munchurl Kim
author_sort Saehun Kim
collection DOAJ
description Numerous methods for style transfer have been developed using unsupervised learning and gained impressive results. However, optimal style transfer cannot be conducted from a global fashion in certain style domains, mainly when a single target-style domain contains semantic objects that have their own distinct and unique styles, e.g., those objects in the anime-style domain. Previous methods are incongruent because the unsupervised learning can not provide the semantic mappings between the multi-style objects according to their unique styles. Thus, in this paper, we propose a pseudo-supervised learning framework for the semantic multi-style transfer (SMST), which consists of (i) a pseudo ground truth (pGT) generation phase and (ii) a SMST learning phase. In the pGT generation phase, multiple semantic objects of the photo images are separately transferred to the target-domain object styles in an object-oriented fashion. Then the transferred objects are composed back to an image, which is the pGT. In the SMST learning phase, a SMST network (SMSTnet) is trained with the pairs of the photo images and its respective pGT in a supervised manner. From this, our framework can provide the semantic mappings of multi-style objects. Moreover, to embrace the multi-styles of various objects into a single generator, we design the SMSTnet with channel attentions in conjunction with a discriminator dedicated to our pseudo-supervised learning. Our method has been applied and intensively tested for anime-style transfer learning. The experimental results demonstrate the effectiveness of our method and show its superiority compared to the state-of-the-art methods.
first_indexed 2024-12-10T11:15:58Z
format Article
id doaj.art-03dbd0bc6f394002b5e8e3e924d22bf6
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-10T11:15:58Z
publishDate 2021-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-03dbd0bc6f394002b5e8e3e924d22bf62022-12-22T01:51:10ZengIEEEIEEE Access2169-35362021-01-0197930794210.1109/ACCESS.2021.30496379316188Pseudo-Supervised Learning for Semantic Multi-Style TransferSaehun Kim0https://orcid.org/0000-0002-7250-2654Jeonghyeok Do1https://orcid.org/0000-0003-0030-0129Munchurl Kim2https://orcid.org/0000-0003-0146-5419School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South KoreaSchool of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South KoreaSchool of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South KoreaNumerous methods for style transfer have been developed using unsupervised learning and gained impressive results. However, optimal style transfer cannot be conducted from a global fashion in certain style domains, mainly when a single target-style domain contains semantic objects that have their own distinct and unique styles, e.g., those objects in the anime-style domain. Previous methods are incongruent because the unsupervised learning can not provide the semantic mappings between the multi-style objects according to their unique styles. Thus, in this paper, we propose a pseudo-supervised learning framework for the semantic multi-style transfer (SMST), which consists of (i) a pseudo ground truth (pGT) generation phase and (ii) a SMST learning phase. In the pGT generation phase, multiple semantic objects of the photo images are separately transferred to the target-domain object styles in an object-oriented fashion. Then the transferred objects are composed back to an image, which is the pGT. In the SMST learning phase, a SMST network (SMSTnet) is trained with the pairs of the photo images and its respective pGT in a supervised manner. From this, our framework can provide the semantic mappings of multi-style objects. Moreover, to embrace the multi-styles of various objects into a single generator, we design the SMSTnet with channel attentions in conjunction with a discriminator dedicated to our pseudo-supervised learning. Our method has been applied and intensively tested for anime-style transfer learning. The experimental results demonstrate the effectiveness of our method and show its superiority compared to the state-of-the-art methods.https://ieeexplore.ieee.org/document/9316188/Style transferimage-to-image translationgenerative adversarial networks
spellingShingle Saehun Kim
Jeonghyeok Do
Munchurl Kim
Pseudo-Supervised Learning for Semantic Multi-Style Transfer
IEEE Access
Style transfer
image-to-image translation
generative adversarial networks
title Pseudo-Supervised Learning for Semantic Multi-Style Transfer
title_full Pseudo-Supervised Learning for Semantic Multi-Style Transfer
title_fullStr Pseudo-Supervised Learning for Semantic Multi-Style Transfer
title_full_unstemmed Pseudo-Supervised Learning for Semantic Multi-Style Transfer
title_short Pseudo-Supervised Learning for Semantic Multi-Style Transfer
title_sort pseudo supervised learning for semantic multi style transfer
topic Style transfer
image-to-image translation
generative adversarial networks
url https://ieeexplore.ieee.org/document/9316188/
work_keys_str_mv AT saehunkim pseudosupervisedlearningforsemanticmultistyletransfer
AT jeonghyeokdo pseudosupervisedlearningforsemanticmultistyletransfer
AT munchurlkim pseudosupervisedlearningforsemanticmultistyletransfer