Autoencoder-based conditional optimal transport generative adversarial network for medical image generation

Medical image generation has recently garnered significant interest among researchers. However, the primary generative models, such as Generative Adversarial Networks (GANs), often encounter challenges during training, including mode collapse. To address these issues, we proposed the AE-COT-GAN mode...

Full description

Bibliographic Details
Main Authors:	Jun Wang, Bohan Lei, Liya Ding, Xiaoyin Xu, Xianfeng Gu, Min Zhang
Format:	Article
Language:	English
Published:	Elsevier 2024-03-01
Series:	Visual Informatics
Subjects:	Medical image generation Mode collapse Mode mixing Optimal transport Generative adversarial networks
Online Access:	http://www.sciencedirect.com/science/article/pii/S2468502X23000529

_version_	1827295923038846976
author	Jun Wang Bohan Lei Liya Ding Xiaoyin Xu Xianfeng Gu Min Zhang
author_facet	Jun Wang Bohan Lei Liya Ding Xiaoyin Xu Xianfeng Gu Min Zhang
author_sort	Jun Wang
collection	DOAJ
description	Medical image generation has recently garnered significant interest among researchers. However, the primary generative models, such as Generative Adversarial Networks (GANs), often encounter challenges during training, including mode collapse. To address these issues, we proposed the AE-COT-GAN model (Autoencoder-based Conditional Optimal Transport Generative Adversarial Network) for the generation of medical images belonging to specific categories. The training process of our model comprises three fundamental components. The training process of our model encompasses three fundamental components. First, we employ an autoencoder model to obtain a low-dimensional manifold representation of real images. Second, we apply extended semi-discrete optimal transport to map Gaussian noise distribution to the latent space distribution and obtain corresponding labels effectively. This procedure leads to the generation of new latent codes with known labels. Finally, we integrate a GAN to train the decoder further to generate medical images. To evaluate the performance of the AE-COT-GAN model, we conducted experiments on two medical image datasets, namely DermaMNIST and BloodMNIST. The model’s performance was compared with state-of-the-art generative models. Results show that the AE-COT-GAN model had excellent performance in generating medical images. Moreover, it effectively addressed the common issues associated with traditional GANs.
first_indexed	2024-04-24T14:28:24Z
format	Article
id	doaj.art-e3dbfe4ea80a409a8d56ed198930b192
institution	Directory Open Access Journal
issn	2468-502X
language	English
last_indexed	2024-04-24T14:28:24Z
publishDate	2024-03-01
publisher	Elsevier
record_format	Article
series	Visual Informatics
spelling	doaj.art-e3dbfe4ea80a409a8d56ed198930b1922024-04-03T04:27:09ZengElsevierVisual Informatics2468-502X2024-03-01811525Autoencoder-based conditional optimal transport generative adversarial network for medical image generationJun Wang0Bohan Lei1Liya Ding2Xiaoyin Xu3Xianfeng Gu4Min Zhang5School of Software Technology, Zhejiang University, Ningbo, Zhejiang, ChinaCollege of Computer Science & Technology, Zhejiang University, Hangzhou, Zhejiang, ChinaDepartment of Pathology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, ChinaCollege of Biomedical Engineering & Instrument Science, Zhejiang University, Hangzhou, Zhejiang, ChinaDepartment of Computer Science, State University of New York at Stony Brook, Stony Brook, NY, USACollege of Computer Science & Technology, Zhejiang University, Hangzhou, Zhejiang, China; Corresponding author.Medical image generation has recently garnered significant interest among researchers. However, the primary generative models, such as Generative Adversarial Networks (GANs), often encounter challenges during training, including mode collapse. To address these issues, we proposed the AE-COT-GAN model (Autoencoder-based Conditional Optimal Transport Generative Adversarial Network) for the generation of medical images belonging to specific categories. The training process of our model comprises three fundamental components. The training process of our model encompasses three fundamental components. First, we employ an autoencoder model to obtain a low-dimensional manifold representation of real images. Second, we apply extended semi-discrete optimal transport to map Gaussian noise distribution to the latent space distribution and obtain corresponding labels effectively. This procedure leads to the generation of new latent codes with known labels. Finally, we integrate a GAN to train the decoder further to generate medical images. To evaluate the performance of the AE-COT-GAN model, we conducted experiments on two medical image datasets, namely DermaMNIST and BloodMNIST. The model’s performance was compared with state-of-the-art generative models. Results show that the AE-COT-GAN model had excellent performance in generating medical images. Moreover, it effectively addressed the common issues associated with traditional GANs.http://www.sciencedirect.com/science/article/pii/S2468502X23000529Medical image generationMode collapseMode mixingOptimal transportGenerative adversarial networks
spellingShingle	Jun Wang Bohan Lei Liya Ding Xiaoyin Xu Xianfeng Gu Min Zhang Autoencoder-based conditional optimal transport generative adversarial network for medical image generation Visual Informatics Medical image generation Mode collapse Mode mixing Optimal transport Generative adversarial networks
title	Autoencoder-based conditional optimal transport generative adversarial network for medical image generation
title_full	Autoencoder-based conditional optimal transport generative adversarial network for medical image generation
title_fullStr	Autoencoder-based conditional optimal transport generative adversarial network for medical image generation
title_full_unstemmed	Autoencoder-based conditional optimal transport generative adversarial network for medical image generation
title_short	Autoencoder-based conditional optimal transport generative adversarial network for medical image generation
title_sort	autoencoder based conditional optimal transport generative adversarial network for medical image generation
topic	Medical image generation Mode collapse Mode mixing Optimal transport Generative adversarial networks
url	http://www.sciencedirect.com/science/article/pii/S2468502X23000529
work_keys_str_mv	AT junwang autoencoderbasedconditionaloptimaltransportgenerativeadversarialnetworkformedicalimagegeneration AT bohanlei autoencoderbasedconditionaloptimaltransportgenerativeadversarialnetworkformedicalimagegeneration AT liyading autoencoderbasedconditionaloptimaltransportgenerativeadversarialnetworkformedicalimagegeneration AT xiaoyinxu autoencoderbasedconditionaloptimaltransportgenerativeadversarialnetworkformedicalimagegeneration AT xianfenggu autoencoderbasedconditionaloptimaltransportgenerativeadversarialnetworkformedicalimagegeneration AT minzhang autoencoderbasedconditionaloptimaltransportgenerativeadversarialnetworkformedicalimagegeneration

Autoencoder-based conditional optimal transport generative adversarial network for medical image generation

Similar Items