Machine learning models including insulin resistance indexes for predicting liver stiffness in United States population: Data from NHANES

BackgroundPrevention and treatment of liver fibrosis at an early stage is of great prognostic importance, whereas changes in liver stiffness are often overlooked in patients before the onset of obvious clinical symptoms. Recognition of liver fibrosis at an early stage is therefore essential.Objectiv...

Full description

Bibliographic Details
Main Authors: Kexing Han, Kexuan Tan, Jiapei Shen, Yuting Gu, Zilong Wang, Jiayu He, Luyang Kang, Weijie Sun, Long Gao, Yufeng Gao
Format: Article
Language:English
Published: Frontiers Media S.A. 2022-09-01
Series:Frontiers in Public Health
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fpubh.2022.1008794/full
_version_ 1811266075956871168
author Kexing Han
Kexuan Tan
Jiapei Shen
Yuting Gu
Zilong Wang
Jiayu He
Luyang Kang
Weijie Sun
Long Gao
Yufeng Gao
author_facet Kexing Han
Kexuan Tan
Jiapei Shen
Yuting Gu
Zilong Wang
Jiayu He
Luyang Kang
Weijie Sun
Long Gao
Yufeng Gao
author_sort Kexing Han
collection DOAJ
description BackgroundPrevention and treatment of liver fibrosis at an early stage is of great prognostic importance, whereas changes in liver stiffness are often overlooked in patients before the onset of obvious clinical symptoms. Recognition of liver fibrosis at an early stage is therefore essential.ObjectiveAn XGBoost machine learning model was constructed to predict participants' liver stiffness measures (LSM) from general characteristic information, blood test metrics and insulin resistance-related indexes, and to compare the fit efficacy of different datasets for LSM.MethodsAll data were obtained from the National Health and Nutrition Examination Survey (NHANES) for the time interval January 2017 to March 2020. Participants' general characteristics, Liver Ultrasound Transient Elastography (LUTE) information, indicators of blood tests and insulin resistance-related indexes were collected, including homeostasis model assessment of insulin resistance (HOMA-IR) and metabolic score for insulin resistance (METS-IR). Three datasets were generated based on the above information, respectively named dataset A (without the insulin resistance-related indexes as predictor variables), dataset B (with METS-IR as a predictor variable) and dataset C (with HOMA-IR as a predictor variable). XGBoost regression was used in the three datasets to construct machine learning models to predict LSM in participants. A random split was used to divide all participants included in the study into training and validation cohorts in a 3:1 ratio, and models were developed in the training cohort and validated with the validation cohort.ResultsA total of 3,564 participants were included in this study, 2,376 in the training cohort and 1,188 in the validation cohort, and all information was not statistically significantly different between the two cohorts (p > 0.05). In the training cohort, datasets A and B both had better predictive efficacy than dataset C for participants' LSM, with dataset B having the best fitting efficacy [±1.96 standard error (SD), (-1.49,1.48) kPa], which was similarly validated in the validation cohort [±1.96 SD, (-1.56,1.56) kPa].ConclusionsXGBoost machine learning models built from general characteristic information and clinically accessible blood test indicators are practicable for predicting LSM in participants, and a dataset that included METS-IR as a predictor variable would improve the accuracy and stability of the models.
first_indexed 2024-04-12T20:36:38Z
format Article
id doaj.art-f74b17640d9d4efdbc1e0d15a3e14015
institution Directory Open Access Journal
issn 2296-2565
language English
last_indexed 2024-04-12T20:36:38Z
publishDate 2022-09-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Public Health
spelling doaj.art-f74b17640d9d4efdbc1e0d15a3e140152022-12-22T03:17:35ZengFrontiers Media S.A.Frontiers in Public Health2296-25652022-09-011010.3389/fpubh.2022.10087941008794Machine learning models including insulin resistance indexes for predicting liver stiffness in United States population: Data from NHANESKexing HanKexuan TanJiapei ShenYuting GuZilong WangJiayu HeLuyang KangWeijie SunLong GaoYufeng GaoBackgroundPrevention and treatment of liver fibrosis at an early stage is of great prognostic importance, whereas changes in liver stiffness are often overlooked in patients before the onset of obvious clinical symptoms. Recognition of liver fibrosis at an early stage is therefore essential.ObjectiveAn XGBoost machine learning model was constructed to predict participants' liver stiffness measures (LSM) from general characteristic information, blood test metrics and insulin resistance-related indexes, and to compare the fit efficacy of different datasets for LSM.MethodsAll data were obtained from the National Health and Nutrition Examination Survey (NHANES) for the time interval January 2017 to March 2020. Participants' general characteristics, Liver Ultrasound Transient Elastography (LUTE) information, indicators of blood tests and insulin resistance-related indexes were collected, including homeostasis model assessment of insulin resistance (HOMA-IR) and metabolic score for insulin resistance (METS-IR). Three datasets were generated based on the above information, respectively named dataset A (without the insulin resistance-related indexes as predictor variables), dataset B (with METS-IR as a predictor variable) and dataset C (with HOMA-IR as a predictor variable). XGBoost regression was used in the three datasets to construct machine learning models to predict LSM in participants. A random split was used to divide all participants included in the study into training and validation cohorts in a 3:1 ratio, and models were developed in the training cohort and validated with the validation cohort.ResultsA total of 3,564 participants were included in this study, 2,376 in the training cohort and 1,188 in the validation cohort, and all information was not statistically significantly different between the two cohorts (p > 0.05). In the training cohort, datasets A and B both had better predictive efficacy than dataset C for participants' LSM, with dataset B having the best fitting efficacy [±1.96 standard error (SD), (-1.49,1.48) kPa], which was similarly validated in the validation cohort [±1.96 SD, (-1.56,1.56) kPa].ConclusionsXGBoost machine learning models built from general characteristic information and clinically accessible blood test indicators are practicable for predicting LSM in participants, and a dataset that included METS-IR as a predictor variable would improve the accuracy and stability of the models.https://www.frontiersin.org/articles/10.3389/fpubh.2022.1008794/fullliver cirrhosisliver stiffness measurement (LSM)insulin resistanceHOMA-IRMETS-IRmachine learning model
spellingShingle Kexing Han
Kexuan Tan
Jiapei Shen
Yuting Gu
Zilong Wang
Jiayu He
Luyang Kang
Weijie Sun
Long Gao
Yufeng Gao
Machine learning models including insulin resistance indexes for predicting liver stiffness in United States population: Data from NHANES
Frontiers in Public Health
liver cirrhosis
liver stiffness measurement (LSM)
insulin resistance
HOMA-IR
METS-IR
machine learning model
title Machine learning models including insulin resistance indexes for predicting liver stiffness in United States population: Data from NHANES
title_full Machine learning models including insulin resistance indexes for predicting liver stiffness in United States population: Data from NHANES
title_fullStr Machine learning models including insulin resistance indexes for predicting liver stiffness in United States population: Data from NHANES
title_full_unstemmed Machine learning models including insulin resistance indexes for predicting liver stiffness in United States population: Data from NHANES
title_short Machine learning models including insulin resistance indexes for predicting liver stiffness in United States population: Data from NHANES
title_sort machine learning models including insulin resistance indexes for predicting liver stiffness in united states population data from nhanes
topic liver cirrhosis
liver stiffness measurement (LSM)
insulin resistance
HOMA-IR
METS-IR
machine learning model
url https://www.frontiersin.org/articles/10.3389/fpubh.2022.1008794/full
work_keys_str_mv AT kexinghan machinelearningmodelsincludinginsulinresistanceindexesforpredictingliverstiffnessinunitedstatespopulationdatafromnhanes
AT kexuantan machinelearningmodelsincludinginsulinresistanceindexesforpredictingliverstiffnessinunitedstatespopulationdatafromnhanes
AT jiapeishen machinelearningmodelsincludinginsulinresistanceindexesforpredictingliverstiffnessinunitedstatespopulationdatafromnhanes
AT yutinggu machinelearningmodelsincludinginsulinresistanceindexesforpredictingliverstiffnessinunitedstatespopulationdatafromnhanes
AT zilongwang machinelearningmodelsincludinginsulinresistanceindexesforpredictingliverstiffnessinunitedstatespopulationdatafromnhanes
AT jiayuhe machinelearningmodelsincludinginsulinresistanceindexesforpredictingliverstiffnessinunitedstatespopulationdatafromnhanes
AT luyangkang machinelearningmodelsincludinginsulinresistanceindexesforpredictingliverstiffnessinunitedstatespopulationdatafromnhanes
AT weijiesun machinelearningmodelsincludinginsulinresistanceindexesforpredictingliverstiffnessinunitedstatespopulationdatafromnhanes
AT longgao machinelearningmodelsincludinginsulinresistanceindexesforpredictingliverstiffnessinunitedstatespopulationdatafromnhanes
AT yufenggao machinelearningmodelsincludinginsulinresistanceindexesforpredictingliverstiffnessinunitedstatespopulationdatafromnhanes