PlethAugment: GAN-based PPG augmentation for medical diagnosis in low-resource settings

The paucity of physiological time-series data collected from low-resource clinical settings limits the capabilities of modern machine learning algorithms in achieving high performance. Such performance is further hindered by class imbalance; datasets where a diagnosis is much more common than others...

Full description

Bibliographic Details
Main Authors: Kiyasseh, D, Tadesse, GA, Nhan, LNT, Tan, LV, Thwaites, L, Zhu, T, Clifton, D
Format: Journal article
Language:English
Published: Institute of Electrical and Electronics Engineers 2020
_version_ 1797063893649981440
author Kiyasseh, D
Tadesse, GA
Nhan, LNT
Tan, LV
Thwaites, L
Zhu, T
Clifton, D
author_facet Kiyasseh, D
Tadesse, GA
Nhan, LNT
Tan, LV
Thwaites, L
Zhu, T
Clifton, D
author_sort Kiyasseh, D
collection OXFORD
description The paucity of physiological time-series data collected from low-resource clinical settings limits the capabilities of modern machine learning algorithms in achieving high performance. Such performance is further hindered by class imbalance; datasets where a diagnosis is much more common than others. To overcome these two issues at low-cost while preserving privacy, data augmentation methods can be employed. In the time domain, the traditional method of time-warping could alter the underlying data distribution with detrimental consequences. This is prominent when dealing with physiological conditions that influence the frequency components of data. In this paper, we propose PlethAugment; three different conditional generative adversarial networks (CGANs) with an adapted diversity term for the generation of pathological photoplethysmogram (PPG) signals in order to boost medical classification performance. To evaluate and compare the GANs, we introduce a novel metric-agnostic method; the synthetic generalization curve. We validate this approach on two proprietary and two public datasets representing a diverse set of medical conditions. Compared to training on non-augmented class-balanced datasets, training on augmented datasets leads to an improvement of the AUROC by up to 29% when using cross validation. This illustrates the potential of the proposed CGANs to significantly improve classification performance.
first_indexed 2024-03-06T21:06:28Z
format Journal article
id oxford-uuid:3ca3b2e5-f6e9-4975-9596-3e31bea8f9ea
institution University of Oxford
language English
last_indexed 2024-03-06T21:06:28Z
publishDate 2020
publisher Institute of Electrical and Electronics Engineers
record_format dspace
spelling oxford-uuid:3ca3b2e5-f6e9-4975-9596-3e31bea8f9ea2022-03-26T14:14:55ZPlethAugment: GAN-based PPG augmentation for medical diagnosis in low-resource settingsJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:3ca3b2e5-f6e9-4975-9596-3e31bea8f9eaEnglishSymplectic ElementsInstitute of Electrical and Electronics Engineers2020Kiyasseh, DTadesse, GANhan, LNTTan, LVThwaites, LZhu, TClifton, DThe paucity of physiological time-series data collected from low-resource clinical settings limits the capabilities of modern machine learning algorithms in achieving high performance. Such performance is further hindered by class imbalance; datasets where a diagnosis is much more common than others. To overcome these two issues at low-cost while preserving privacy, data augmentation methods can be employed. In the time domain, the traditional method of time-warping could alter the underlying data distribution with detrimental consequences. This is prominent when dealing with physiological conditions that influence the frequency components of data. In this paper, we propose PlethAugment; three different conditional generative adversarial networks (CGANs) with an adapted diversity term for the generation of pathological photoplethysmogram (PPG) signals in order to boost medical classification performance. To evaluate and compare the GANs, we introduce a novel metric-agnostic method; the synthetic generalization curve. We validate this approach on two proprietary and two public datasets representing a diverse set of medical conditions. Compared to training on non-augmented class-balanced datasets, training on augmented datasets leads to an improvement of the AUROC by up to 29% when using cross validation. This illustrates the potential of the proposed CGANs to significantly improve classification performance.
spellingShingle Kiyasseh, D
Tadesse, GA
Nhan, LNT
Tan, LV
Thwaites, L
Zhu, T
Clifton, D
PlethAugment: GAN-based PPG augmentation for medical diagnosis in low-resource settings
title PlethAugment: GAN-based PPG augmentation for medical diagnosis in low-resource settings
title_full PlethAugment: GAN-based PPG augmentation for medical diagnosis in low-resource settings
title_fullStr PlethAugment: GAN-based PPG augmentation for medical diagnosis in low-resource settings
title_full_unstemmed PlethAugment: GAN-based PPG augmentation for medical diagnosis in low-resource settings
title_short PlethAugment: GAN-based PPG augmentation for medical diagnosis in low-resource settings
title_sort plethaugment gan based ppg augmentation for medical diagnosis in low resource settings
work_keys_str_mv AT kiyassehd plethaugmentganbasedppgaugmentationformedicaldiagnosisinlowresourcesettings
AT tadessega plethaugmentganbasedppgaugmentationformedicaldiagnosisinlowresourcesettings
AT nhanlnt plethaugmentganbasedppgaugmentationformedicaldiagnosisinlowresourcesettings
AT tanlv plethaugmentganbasedppgaugmentationformedicaldiagnosisinlowresourcesettings
AT thwaitesl plethaugmentganbasedppgaugmentationformedicaldiagnosisinlowresourcesettings
AT zhut plethaugmentganbasedppgaugmentationformedicaldiagnosisinlowresourcesettings
AT cliftond plethaugmentganbasedppgaugmentationformedicaldiagnosisinlowresourcesettings