Estimation of the Prevalence of Nonalcoholic Fatty Liver Disease in an Adult Population in Northern China Using the Data Mining Approach

TengFei Yang,1 Bo Zhao,2 Dongmei Pei1 1Department of Health Management, Shengjing Hospital of China Medical University, Shenyang, People’s Republic of China; 2Department of Pulmonary and Critical Care Medicine, Shengjing Hospital of China Medical University, Shenyang, People’s Republic of ChinaCorre...

Full description

Bibliographic Details
Main Authors: Yang T, Zhao B, Pei D
Format: Article
Language:English
Published: Dove Medical Press 2021-07-01
Series:Diabetes, Metabolic Syndrome and Obesity
Subjects:
Online Access:https://www.dovepress.com/estimation-of-the-prevalence-of-nonalcoholic-fatty-liver-disease-in-an-peer-reviewed-fulltext-article-DMSO
_version_ 1797935174726451200
author Yang T
Zhao B
Pei D
author_facet Yang T
Zhao B
Pei D
author_sort Yang T
collection DOAJ
description TengFei Yang,1 Bo Zhao,2 Dongmei Pei1 1Department of Health Management, Shengjing Hospital of China Medical University, Shenyang, People’s Republic of China; 2Department of Pulmonary and Critical Care Medicine, Shengjing Hospital of China Medical University, Shenyang, People’s Republic of ChinaCorrespondence: Dongmei PeiDepartment of Health Management, Shengjing Hospital of China Medical University, No. 36, Sanhao Street, Heping District, Shenyang, 110004, People’s Republic of ChinaEmail peidm1111@hotmail.comBackground: Nonalcoholic fatty liver disease (NAFLD) is the commonest form of chronic liver disease worldwide and its prevalence is rapidly increasing. Screening and early diagnosis of high-risk groups are important for the prevention and treatment of NAFLD; however, traditional imaging examinations are expensive and difficult to perform on a large scale. This study aimed to develop a simple and reliable predictive model based on the risk factors for NAFLD using a decision tree algorithm for the diagnosis of NAFLD and reduction of healthcare costs.Methods: This retrospective cross-sectional study included 22,819 participants who underwent annual health examinations between January 2019 and December 2019 at Physical Examination Center in Shengjing Hospital of China Medical University. After rigorous data screening, data of 9190 participants were retained in the final dataset for use in the J48 decision tree algorithm for the construction of predictive models. Approximately 66% of these patients (n=6065) were randomly assigned to the training dataset for the construction of the decision tree, while 34% of the patients (n=3125) were assigned to the test dataset to evaluate the performance of the decision tree.Results: The results showed that the J48 decision tree classifier exhibited good performance (accuracy=0.830, precision=0.837, recall=0.830, F-measure=0.830, and area under the curve=0.905). The decision tree structure revealed waist circumference as the most significant attribute, followed by triglyceride levels, systolic blood pressure, sex, age, and total cholesterol level.Conclusion: Our study suggests that a decision tree analysis can be used to screen high-risk individuals for NAFLD. The key attributes in the tree structure can further contribute to the prevention of NAFLD by suggesting implementable targeted community interventions, which can help improve the outcome of NAFLD and reduce the burden on the healthcare system.Keywords: nonalcoholic fatty liver disease, J48 algorithm, decision tree, risk factors
first_indexed 2024-04-10T18:10:17Z
format Article
id doaj.art-50224dc183b9442cb763ec37370a453e
institution Directory Open Access Journal
issn 1178-7007
language English
last_indexed 2024-04-10T18:10:17Z
publishDate 2021-07-01
publisher Dove Medical Press
record_format Article
series Diabetes, Metabolic Syndrome and Obesity
spelling doaj.art-50224dc183b9442cb763ec37370a453e2023-02-02T11:07:02ZengDove Medical PressDiabetes, Metabolic Syndrome and Obesity1178-70072021-07-01Volume 143437344567368Estimation of the Prevalence of Nonalcoholic Fatty Liver Disease in an Adult Population in Northern China Using the Data Mining ApproachYang TZhao BPei DTengFei Yang,1 Bo Zhao,2 Dongmei Pei1 1Department of Health Management, Shengjing Hospital of China Medical University, Shenyang, People’s Republic of China; 2Department of Pulmonary and Critical Care Medicine, Shengjing Hospital of China Medical University, Shenyang, People’s Republic of ChinaCorrespondence: Dongmei PeiDepartment of Health Management, Shengjing Hospital of China Medical University, No. 36, Sanhao Street, Heping District, Shenyang, 110004, People’s Republic of ChinaEmail peidm1111@hotmail.comBackground: Nonalcoholic fatty liver disease (NAFLD) is the commonest form of chronic liver disease worldwide and its prevalence is rapidly increasing. Screening and early diagnosis of high-risk groups are important for the prevention and treatment of NAFLD; however, traditional imaging examinations are expensive and difficult to perform on a large scale. This study aimed to develop a simple and reliable predictive model based on the risk factors for NAFLD using a decision tree algorithm for the diagnosis of NAFLD and reduction of healthcare costs.Methods: This retrospective cross-sectional study included 22,819 participants who underwent annual health examinations between January 2019 and December 2019 at Physical Examination Center in Shengjing Hospital of China Medical University. After rigorous data screening, data of 9190 participants were retained in the final dataset for use in the J48 decision tree algorithm for the construction of predictive models. Approximately 66% of these patients (n=6065) were randomly assigned to the training dataset for the construction of the decision tree, while 34% of the patients (n=3125) were assigned to the test dataset to evaluate the performance of the decision tree.Results: The results showed that the J48 decision tree classifier exhibited good performance (accuracy=0.830, precision=0.837, recall=0.830, F-measure=0.830, and area under the curve=0.905). The decision tree structure revealed waist circumference as the most significant attribute, followed by triglyceride levels, systolic blood pressure, sex, age, and total cholesterol level.Conclusion: Our study suggests that a decision tree analysis can be used to screen high-risk individuals for NAFLD. The key attributes in the tree structure can further contribute to the prevention of NAFLD by suggesting implementable targeted community interventions, which can help improve the outcome of NAFLD and reduce the burden on the healthcare system.Keywords: nonalcoholic fatty liver disease, J48 algorithm, decision tree, risk factorshttps://www.dovepress.com/estimation-of-the-prevalence-of-nonalcoholic-fatty-liver-disease-in-an-peer-reviewed-fulltext-article-DMSOnonalcoholic fatty liver diseasej48 algorithmdecision treerisk factors
spellingShingle Yang T
Zhao B
Pei D
Estimation of the Prevalence of Nonalcoholic Fatty Liver Disease in an Adult Population in Northern China Using the Data Mining Approach
Diabetes, Metabolic Syndrome and Obesity
nonalcoholic fatty liver disease
j48 algorithm
decision tree
risk factors
title Estimation of the Prevalence of Nonalcoholic Fatty Liver Disease in an Adult Population in Northern China Using the Data Mining Approach
title_full Estimation of the Prevalence of Nonalcoholic Fatty Liver Disease in an Adult Population in Northern China Using the Data Mining Approach
title_fullStr Estimation of the Prevalence of Nonalcoholic Fatty Liver Disease in an Adult Population in Northern China Using the Data Mining Approach
title_full_unstemmed Estimation of the Prevalence of Nonalcoholic Fatty Liver Disease in an Adult Population in Northern China Using the Data Mining Approach
title_short Estimation of the Prevalence of Nonalcoholic Fatty Liver Disease in an Adult Population in Northern China Using the Data Mining Approach
title_sort estimation of the prevalence of nonalcoholic fatty liver disease in an adult population in northern china using the data mining approach
topic nonalcoholic fatty liver disease
j48 algorithm
decision tree
risk factors
url https://www.dovepress.com/estimation-of-the-prevalence-of-nonalcoholic-fatty-liver-disease-in-an-peer-reviewed-fulltext-article-DMSO
work_keys_str_mv AT yangt estimationoftheprevalenceofnonalcoholicfattyliverdiseaseinanadultpopulationinnorthernchinausingthedataminingapproach
AT zhaob estimationoftheprevalenceofnonalcoholicfattyliverdiseaseinanadultpopulationinnorthernchinausingthedataminingapproach
AT peid estimationoftheprevalenceofnonalcoholicfattyliverdiseaseinanadultpopulationinnorthernchinausingthedataminingapproach