Mdwgan-gp: data augmentation for gene expression data based on multiple discriminator WGAN-GP

Abstract Background Although gene expression data play significant roles in biological and medical studies, their applications are hampered due to the difficulty and high expenses of gathering them through biological experiments. It is an urgent problem to generate high quality gene expression data...

Полное описание

Библиографические подробности
Главные авторы: Rongyuan Li, Jingli Wu, Gaoshi Li, Jiafei Liu, Junbo Xuan, Qi Zhu
Формат: Статья
Язык:English
Опубликовано: BMC 2023-11-01
Серии:BMC Bioinformatics
Предметы:
Online-ссылка:https://doi.org/10.1186/s12859-023-05558-9
Описание
Итог:Abstract Background Although gene expression data play significant roles in biological and medical studies, their applications are hampered due to the difficulty and high expenses of gathering them through biological experiments. It is an urgent problem to generate high quality gene expression data with computational methods. WGAN-GP, a generative adversarial network-based method, has been successfully applied in augmenting gene expression data. However, mode collapse or over-fitting may take place for small training samples due to just one discriminator is adopted in the method. Results In this study, an improved data augmentation approach MDWGAN-GP, a generative adversarial network model with multiple discriminators, is proposed. In addition, a novel method is devised for enriching training samples based on linear graph convolutional network. Extensive experiments were implemented on real biological data. Conclusions The experimental results have demonstrated that compared with other state-of-the-art methods, the MDWGAN-GP method can produce higher quality generated gene expression data in most cases.
ISSN:1471-2105