Predicting long-term type 2 diabetes with support vector machine using oral glucose tolerance test.

Diabetes is a large healthcare burden worldwide. There is substantial evidence that lifestyle modifications and drug intervention can prevent diabetes, therefore, an early identification of high risk individuals is important to design targeted prevention strategies. In this paper, we present an auto...

Full description

Bibliographic Details
Main Authors: Hasan T Abbas, Lejla Alic, Madhav Erraguntla, Jim X Ji, Muhammad Abdul-Ghani, Qammer H Abbasi, Marwa K Qaraqe
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2019-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0219636
_version_ 1819259529497411584
author Hasan T Abbas
Lejla Alic
Madhav Erraguntla
Jim X Ji
Muhammad Abdul-Ghani
Qammer H Abbasi
Marwa K Qaraqe
author_facet Hasan T Abbas
Lejla Alic
Madhav Erraguntla
Jim X Ji
Muhammad Abdul-Ghani
Qammer H Abbasi
Marwa K Qaraqe
author_sort Hasan T Abbas
collection DOAJ
description Diabetes is a large healthcare burden worldwide. There is substantial evidence that lifestyle modifications and drug intervention can prevent diabetes, therefore, an early identification of high risk individuals is important to design targeted prevention strategies. In this paper, we present an automatic tool that uses machine learning techniques to predict the development of type 2 diabetes mellitus (T2DM). Data generated from an oral glucose tolerance test (OGTT) was used to develop a predictive model based on the support vector machine (SVM). We trained and validated the models using the OGTT and demographic data of 1,492 healthy individuals collected during the San Antonio Heart Study. This study collected plasma glucose and insulin concentrations before glucose intake and at three time-points thereafter (30, 60 and 120 min). Furthermore, personal information such as age, ethnicity and body-mass index was also a part of the data-set. Using 11 OGTT measurements, we have deduced 61 features, which are then assigned a rank and the top ten features are shortlisted using minimum redundancy maximum relevance feature selection algorithm. All possible combinations of the 10 best ranked features were used to generate SVM based prediction models. This research shows that an individual's plasma glucose levels, and the information derived therefrom have the strongest predictive performance for the future development of T2DM. Significantly, insulin and demographic features do not provide additional performance improvement for diabetes prediction. The results of this work identify the parsimonious clinical data needed to be collected for an efficient prediction of T2DM. Our approach shows an average accuracy of 96.80% and a sensitivity of 80.09% obtained on a holdout set.
first_indexed 2024-12-23T19:11:28Z
format Article
id doaj.art-888c3614dcc845c2a60a1f38cc5341ff
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-23T19:11:28Z
publishDate 2019-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-888c3614dcc845c2a60a1f38cc5341ff2022-12-21T17:34:26ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-011412e021963610.1371/journal.pone.0219636Predicting long-term type 2 diabetes with support vector machine using oral glucose tolerance test.Hasan T AbbasLejla AlicMadhav ErraguntlaJim X JiMuhammad Abdul-GhaniQammer H AbbasiMarwa K QaraqeDiabetes is a large healthcare burden worldwide. There is substantial evidence that lifestyle modifications and drug intervention can prevent diabetes, therefore, an early identification of high risk individuals is important to design targeted prevention strategies. In this paper, we present an automatic tool that uses machine learning techniques to predict the development of type 2 diabetes mellitus (T2DM). Data generated from an oral glucose tolerance test (OGTT) was used to develop a predictive model based on the support vector machine (SVM). We trained and validated the models using the OGTT and demographic data of 1,492 healthy individuals collected during the San Antonio Heart Study. This study collected plasma glucose and insulin concentrations before glucose intake and at three time-points thereafter (30, 60 and 120 min). Furthermore, personal information such as age, ethnicity and body-mass index was also a part of the data-set. Using 11 OGTT measurements, we have deduced 61 features, which are then assigned a rank and the top ten features are shortlisted using minimum redundancy maximum relevance feature selection algorithm. All possible combinations of the 10 best ranked features were used to generate SVM based prediction models. This research shows that an individual's plasma glucose levels, and the information derived therefrom have the strongest predictive performance for the future development of T2DM. Significantly, insulin and demographic features do not provide additional performance improvement for diabetes prediction. The results of this work identify the parsimonious clinical data needed to be collected for an efficient prediction of T2DM. Our approach shows an average accuracy of 96.80% and a sensitivity of 80.09% obtained on a holdout set.https://doi.org/10.1371/journal.pone.0219636
spellingShingle Hasan T Abbas
Lejla Alic
Madhav Erraguntla
Jim X Ji
Muhammad Abdul-Ghani
Qammer H Abbasi
Marwa K Qaraqe
Predicting long-term type 2 diabetes with support vector machine using oral glucose tolerance test.
PLoS ONE
title Predicting long-term type 2 diabetes with support vector machine using oral glucose tolerance test.
title_full Predicting long-term type 2 diabetes with support vector machine using oral glucose tolerance test.
title_fullStr Predicting long-term type 2 diabetes with support vector machine using oral glucose tolerance test.
title_full_unstemmed Predicting long-term type 2 diabetes with support vector machine using oral glucose tolerance test.
title_short Predicting long-term type 2 diabetes with support vector machine using oral glucose tolerance test.
title_sort predicting long term type 2 diabetes with support vector machine using oral glucose tolerance test
url https://doi.org/10.1371/journal.pone.0219636
work_keys_str_mv AT hasantabbas predictinglongtermtype2diabeteswithsupportvectormachineusingoralglucosetolerancetest
AT lejlaalic predictinglongtermtype2diabeteswithsupportvectormachineusingoralglucosetolerancetest
AT madhaverraguntla predictinglongtermtype2diabeteswithsupportvectormachineusingoralglucosetolerancetest
AT jimxji predictinglongtermtype2diabeteswithsupportvectormachineusingoralglucosetolerancetest
AT muhammadabdulghani predictinglongtermtype2diabeteswithsupportvectormachineusingoralglucosetolerancetest
AT qammerhabbasi predictinglongtermtype2diabeteswithsupportvectormachineusingoralglucosetolerancetest
AT marwakqaraqe predictinglongtermtype2diabeteswithsupportvectormachineusingoralglucosetolerancetest