Autoencoder-based conditional optimal transport generative adversarial network for medical image generation

Medical image generation has recently garnered significant interest among researchers. However, the primary generative models, such as Generative Adversarial Networks (GANs), often encounter challenges during training, including mode collapse. To address these issues, we proposed the AE-COT-GAN mode...

Full description

Bibliographic Details
Main Authors: Jun Wang, Bohan Lei, Liya Ding, Xiaoyin Xu, Xianfeng Gu, Min Zhang
Format: Article
Language:English
Published: Elsevier 2024-03-01
Series:Visual Informatics
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2468502X23000529
_version_ 1827295923038846976
author Jun Wang
Bohan Lei
Liya Ding
Xiaoyin Xu
Xianfeng Gu
Min Zhang
author_facet Jun Wang
Bohan Lei
Liya Ding
Xiaoyin Xu
Xianfeng Gu
Min Zhang
author_sort Jun Wang
collection DOAJ
description Medical image generation has recently garnered significant interest among researchers. However, the primary generative models, such as Generative Adversarial Networks (GANs), often encounter challenges during training, including mode collapse. To address these issues, we proposed the AE-COT-GAN model (Autoencoder-based Conditional Optimal Transport Generative Adversarial Network) for the generation of medical images belonging to specific categories. The training process of our model comprises three fundamental components. The training process of our model encompasses three fundamental components. First, we employ an autoencoder model to obtain a low-dimensional manifold representation of real images. Second, we apply extended semi-discrete optimal transport to map Gaussian noise distribution to the latent space distribution and obtain corresponding labels effectively. This procedure leads to the generation of new latent codes with known labels. Finally, we integrate a GAN to train the decoder further to generate medical images. To evaluate the performance of the AE-COT-GAN model, we conducted experiments on two medical image datasets, namely DermaMNIST and BloodMNIST. The model’s performance was compared with state-of-the-art generative models. Results show that the AE-COT-GAN model had excellent performance in generating medical images. Moreover, it effectively addressed the common issues associated with traditional GANs.
first_indexed 2024-04-24T14:28:24Z
format Article
id doaj.art-e3dbfe4ea80a409a8d56ed198930b192
institution Directory Open Access Journal
issn 2468-502X
language English
last_indexed 2024-04-24T14:28:24Z
publishDate 2024-03-01
publisher Elsevier
record_format Article
series Visual Informatics
spelling doaj.art-e3dbfe4ea80a409a8d56ed198930b1922024-04-03T04:27:09ZengElsevierVisual Informatics2468-502X2024-03-01811525Autoencoder-based conditional optimal transport generative adversarial network for medical image generationJun Wang0Bohan Lei1Liya Ding2Xiaoyin Xu3Xianfeng Gu4Min Zhang5School of Software Technology, Zhejiang University, Ningbo, Zhejiang, ChinaCollege of Computer Science & Technology, Zhejiang University, Hangzhou, Zhejiang, ChinaDepartment of Pathology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, ChinaCollege of Biomedical Engineering & Instrument Science, Zhejiang University, Hangzhou, Zhejiang, ChinaDepartment of Computer Science, State University of New York at Stony Brook, Stony Brook, NY, USACollege of Computer Science & Technology, Zhejiang University, Hangzhou, Zhejiang, China; Corresponding author.Medical image generation has recently garnered significant interest among researchers. However, the primary generative models, such as Generative Adversarial Networks (GANs), often encounter challenges during training, including mode collapse. To address these issues, we proposed the AE-COT-GAN model (Autoencoder-based Conditional Optimal Transport Generative Adversarial Network) for the generation of medical images belonging to specific categories. The training process of our model comprises three fundamental components. The training process of our model encompasses three fundamental components. First, we employ an autoencoder model to obtain a low-dimensional manifold representation of real images. Second, we apply extended semi-discrete optimal transport to map Gaussian noise distribution to the latent space distribution and obtain corresponding labels effectively. This procedure leads to the generation of new latent codes with known labels. Finally, we integrate a GAN to train the decoder further to generate medical images. To evaluate the performance of the AE-COT-GAN model, we conducted experiments on two medical image datasets, namely DermaMNIST and BloodMNIST. The model’s performance was compared with state-of-the-art generative models. Results show that the AE-COT-GAN model had excellent performance in generating medical images. Moreover, it effectively addressed the common issues associated with traditional GANs.http://www.sciencedirect.com/science/article/pii/S2468502X23000529Medical image generationMode collapseMode mixingOptimal transportGenerative adversarial networks
spellingShingle Jun Wang
Bohan Lei
Liya Ding
Xiaoyin Xu
Xianfeng Gu
Min Zhang
Autoencoder-based conditional optimal transport generative adversarial network for medical image generation
Visual Informatics
Medical image generation
Mode collapse
Mode mixing
Optimal transport
Generative adversarial networks
title Autoencoder-based conditional optimal transport generative adversarial network for medical image generation
title_full Autoencoder-based conditional optimal transport generative adversarial network for medical image generation
title_fullStr Autoencoder-based conditional optimal transport generative adversarial network for medical image generation
title_full_unstemmed Autoencoder-based conditional optimal transport generative adversarial network for medical image generation
title_short Autoencoder-based conditional optimal transport generative adversarial network for medical image generation
title_sort autoencoder based conditional optimal transport generative adversarial network for medical image generation
topic Medical image generation
Mode collapse
Mode mixing
Optimal transport
Generative adversarial networks
url http://www.sciencedirect.com/science/article/pii/S2468502X23000529
work_keys_str_mv AT junwang autoencoderbasedconditionaloptimaltransportgenerativeadversarialnetworkformedicalimagegeneration
AT bohanlei autoencoderbasedconditionaloptimaltransportgenerativeadversarialnetworkformedicalimagegeneration
AT liyading autoencoderbasedconditionaloptimaltransportgenerativeadversarialnetworkformedicalimagegeneration
AT xiaoyinxu autoencoderbasedconditionaloptimaltransportgenerativeadversarialnetworkformedicalimagegeneration
AT xianfenggu autoencoderbasedconditionaloptimaltransportgenerativeadversarialnetworkformedicalimagegeneration
AT minzhang autoencoderbasedconditionaloptimaltransportgenerativeadversarialnetworkformedicalimagegeneration