Questionnaire and LGBM Model for Assessing Health Literacy levels of Mongolians in China

Abstract Background It is difficult to accurately assess the health literacy(HL) level of Mongolians by using Chinese conventional HL questionnaire, due to their particularity in language, culture and living environment. Therefore, it is very important to design an exclusive HL questionnaire for the...

Full description

Bibliographic Details
Main Authors: Yan Hong, Xiaoda Zhang
Format: Article
Language:English
Published: BMC 2022-11-01
Series:BMC Public Health
Subjects:
Online Access:https://doi.org/10.1186/s12889-022-14392-2
_version_ 1828191209645932544
author Yan Hong
Xiaoda Zhang
author_facet Yan Hong
Xiaoda Zhang
author_sort Yan Hong
collection DOAJ
description Abstract Background It is difficult to accurately assess the health literacy(HL) level of Mongolians by using Chinese conventional HL questionnaire, due to their particularity in language, culture and living environment. Therefore, it is very important to design an exclusive HL questionnaire for them. In addition, the existing statistical models cannot meet the requirement of HL assessment with high precision, so it is necessary to study a new HL assessment model. Methods A HL questionnaire with 68 questions is designed by combing the HLS-EU-Q47and the characteristics of Mongolians in China. 742 Mongolians aged 18 to 87 in Inner Mongolia of China answered the questionnaire. A data set with 742 samples is constructed, where each sample has 68 features and 1 target. Based on it, the XGB and LGBM regression models are respectively constructed to assess the HL levels of respondents, and their evaluation effects are compared. The impact of each question on the HL level is quantitatively analyzed by using the feature-importance function in LGBM model to verify the effectiveness of the questionnaire and to find the key factors for affecting HL. Results The HL questionnaire has the high reliability, which is reflected by the high internal consistency (Cronbach’s coefficient=0.807) and test-retest reliability (Mutual Information Score= 0.803). The validity of the HL questionnaire is obtained by solving KMO and Bartlett Spherical Test Chi-square Value, which are 0.765 and 2486 ( $$p<0.001$$ p < 0.001 ), respectively. $$R^2$$ R 2 index and the absolute error obtained by using the HL assessment model based on LGBM are 0.98347 and 11, which are better than ones by applying the model based-XGB, respectively. The quantitative analysis results show that all 68 questions have influence on HL level, but their degree are different. The first three factors are age, salary level, the judgment ability for the HL information in media, respectively. The HL level distribution of the respondents was 66.71 $$\%$$ % excellent, 25.74 $$\%$$ % good and 7.54 $$\%$$ % poor, respectively. Conclusions The presented HL questionnaire with 68 questions and LGBM regression model can obtain the HL level assessment results with high precision for Mongolians in China. The impact of each question in the questionnaire on the final assessment results can be quantified by using the feature-importance function in LGBM model, which is better than the existing qualitative analysis methods.
first_indexed 2024-04-12T08:33:55Z
format Article
id doaj.art-5dcb8bc69be24c2087c26dbd01d6af3b
institution Directory Open Access Journal
issn 1471-2458
language English
last_indexed 2024-04-12T08:33:55Z
publishDate 2022-11-01
publisher BMC
record_format Article
series BMC Public Health
spelling doaj.art-5dcb8bc69be24c2087c26dbd01d6af3b2022-12-22T03:40:03ZengBMCBMC Public Health1471-24582022-11-0122111110.1186/s12889-022-14392-2Questionnaire and LGBM Model for Assessing Health Literacy levels of Mongolians in ChinaYan Hong0Xiaoda Zhang1School of Nursing, Inner Mongolia Minzu UniversityMicron Intelligent Manufacturing Systems Science and Technology (Beijing) Co., LtdAbstract Background It is difficult to accurately assess the health literacy(HL) level of Mongolians by using Chinese conventional HL questionnaire, due to their particularity in language, culture and living environment. Therefore, it is very important to design an exclusive HL questionnaire for them. In addition, the existing statistical models cannot meet the requirement of HL assessment with high precision, so it is necessary to study a new HL assessment model. Methods A HL questionnaire with 68 questions is designed by combing the HLS-EU-Q47and the characteristics of Mongolians in China. 742 Mongolians aged 18 to 87 in Inner Mongolia of China answered the questionnaire. A data set with 742 samples is constructed, where each sample has 68 features and 1 target. Based on it, the XGB and LGBM regression models are respectively constructed to assess the HL levels of respondents, and their evaluation effects are compared. The impact of each question on the HL level is quantitatively analyzed by using the feature-importance function in LGBM model to verify the effectiveness of the questionnaire and to find the key factors for affecting HL. Results The HL questionnaire has the high reliability, which is reflected by the high internal consistency (Cronbach’s coefficient=0.807) and test-retest reliability (Mutual Information Score= 0.803). The validity of the HL questionnaire is obtained by solving KMO and Bartlett Spherical Test Chi-square Value, which are 0.765 and 2486 ( $$p<0.001$$ p < 0.001 ), respectively. $$R^2$$ R 2 index and the absolute error obtained by using the HL assessment model based on LGBM are 0.98347 and 11, which are better than ones by applying the model based-XGB, respectively. The quantitative analysis results show that all 68 questions have influence on HL level, but their degree are different. The first three factors are age, salary level, the judgment ability for the HL information in media, respectively. The HL level distribution of the respondents was 66.71 $$\%$$ % excellent, 25.74 $$\%$$ % good and 7.54 $$\%$$ % poor, respectively. Conclusions The presented HL questionnaire with 68 questions and LGBM regression model can obtain the HL level assessment results with high precision for Mongolians in China. The impact of each question in the questionnaire on the final assessment results can be quantified by using the feature-importance function in LGBM model, which is better than the existing qualitative analysis methods.https://doi.org/10.1186/s12889-022-14392-2Health literacyAssessment modelLGBM regression modelQuestionnaire designQuantitative analysis
spellingShingle Yan Hong
Xiaoda Zhang
Questionnaire and LGBM Model for Assessing Health Literacy levels of Mongolians in China
BMC Public Health
Health literacy
Assessment model
LGBM regression model
Questionnaire design
Quantitative analysis
title Questionnaire and LGBM Model for Assessing Health Literacy levels of Mongolians in China
title_full Questionnaire and LGBM Model for Assessing Health Literacy levels of Mongolians in China
title_fullStr Questionnaire and LGBM Model for Assessing Health Literacy levels of Mongolians in China
title_full_unstemmed Questionnaire and LGBM Model for Assessing Health Literacy levels of Mongolians in China
title_short Questionnaire and LGBM Model for Assessing Health Literacy levels of Mongolians in China
title_sort questionnaire and lgbm model for assessing health literacy levels of mongolians in china
topic Health literacy
Assessment model
LGBM regression model
Questionnaire design
Quantitative analysis
url https://doi.org/10.1186/s12889-022-14392-2
work_keys_str_mv AT yanhong questionnaireandlgbmmodelforassessinghealthliteracylevelsofmongoliansinchina
AT xiaodazhang questionnaireandlgbmmodelforassessinghealthliteracylevelsofmongoliansinchina