Developing risk prediction models for type 2 diabetes: a systematic review of methodology and reporting.

BACKGROUND: The World Health Organisation estimates that by 2030 there will be approximately 350 million people with type 2 diabetes. Associated with renal complications, heart disease, stroke and peripheral vascular disease, early identification of patients with undiagnosed type 2 diabetes or thos...

Full description

Bibliographic Details
Main Authors: Collins, G, Mallett, S, Omar, O, Yu, L
Format: Journal article
Language:English
Published: 2011
_version_ 1826280303251947520
author Collins, G
Mallett, S
Omar, O
Yu, L
author_facet Collins, G
Mallett, S
Omar, O
Yu, L
author_sort Collins, G
collection OXFORD
description BACKGROUND: The World Health Organisation estimates that by 2030 there will be approximately 350 million people with type 2 diabetes. Associated with renal complications, heart disease, stroke and peripheral vascular disease, early identification of patients with undiagnosed type 2 diabetes or those at an increased risk of developing type 2 diabetes is an important challenge. We sought to systematically review and critically assess the conduct and reporting of methods used to develop risk prediction models for predicting the risk of having undiagnosed (prevalent) or future risk of developing (incident) type 2 diabetes in adults. METHODS: We conducted a systematic search of PubMed and EMBASE databases to identify studies published before May 2011 that describe the development of models combining two or more variables to predict the risk of prevalent or incident type 2 diabetes. We extracted key information that describes aspects of developing a prediction model including study design, sample size and number of events, outcome definition, risk predictor selection and coding, missing data, model-building strategies and aspects of performance. RESULTS: Thirty-nine studies comprising 43 risk prediction models were included. Seventeen studies (44%) reported the development of models to predict incident type 2 diabetes, whilst 15 studies (38%) described the derivation of models to predict prevalent type 2 diabetes. In nine studies (23%), the number of events per variable was less than ten, whilst in fourteen studies there was insufficient information reported for this measure to be calculated. The number of candidate risk predictors ranged from four to sixty-four, and in seven studies it was unclear how many risk predictors were considered. A method, not recommended to select risk predictors for inclusion in the multivariate model, using statistical significance from univariate screening was carried out in eight studies (21%), whilst the selection procedure was unclear in ten studies (26%). Twenty-one risk prediction models (49%) were developed by categorising all continuous risk predictors. The treatment and handling of missing data were not reported in 16 studies (41%). CONCLUSIONS: We found widespread use of poor methods that could jeopardise model development, including univariate pre-screening of variables, categorisation of continuous risk predictors and poor handling of missing data. The use of poor methods affects the reliability of the prediction model and ultimately compromises the accuracy of the probability estimates of having undiagnosed type 2 diabetes or the predicted risk of developing type 2 diabetes. In addition, many studies were characterised by a generally poor level of reporting, with many key details to objectively judge the usefulness of the models often omitted.
first_indexed 2024-03-07T00:11:40Z
format Journal article
id oxford-uuid:7972470e-ca1a-4cf8-95d3-5110b53b59a4
institution University of Oxford
language English
last_indexed 2024-03-07T00:11:40Z
publishDate 2011
record_format dspace
spelling oxford-uuid:7972470e-ca1a-4cf8-95d3-5110b53b59a42022-03-26T20:37:28ZDeveloping risk prediction models for type 2 diabetes: a systematic review of methodology and reporting.Journal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:7972470e-ca1a-4cf8-95d3-5110b53b59a4EnglishSymplectic Elements at Oxford2011Collins, GMallett, SOmar, OYu, L BACKGROUND: The World Health Organisation estimates that by 2030 there will be approximately 350 million people with type 2 diabetes. Associated with renal complications, heart disease, stroke and peripheral vascular disease, early identification of patients with undiagnosed type 2 diabetes or those at an increased risk of developing type 2 diabetes is an important challenge. We sought to systematically review and critically assess the conduct and reporting of methods used to develop risk prediction models for predicting the risk of having undiagnosed (prevalent) or future risk of developing (incident) type 2 diabetes in adults. METHODS: We conducted a systematic search of PubMed and EMBASE databases to identify studies published before May 2011 that describe the development of models combining two or more variables to predict the risk of prevalent or incident type 2 diabetes. We extracted key information that describes aspects of developing a prediction model including study design, sample size and number of events, outcome definition, risk predictor selection and coding, missing data, model-building strategies and aspects of performance. RESULTS: Thirty-nine studies comprising 43 risk prediction models were included. Seventeen studies (44%) reported the development of models to predict incident type 2 diabetes, whilst 15 studies (38%) described the derivation of models to predict prevalent type 2 diabetes. In nine studies (23%), the number of events per variable was less than ten, whilst in fourteen studies there was insufficient information reported for this measure to be calculated. The number of candidate risk predictors ranged from four to sixty-four, and in seven studies it was unclear how many risk predictors were considered. A method, not recommended to select risk predictors for inclusion in the multivariate model, using statistical significance from univariate screening was carried out in eight studies (21%), whilst the selection procedure was unclear in ten studies (26%). Twenty-one risk prediction models (49%) were developed by categorising all continuous risk predictors. The treatment and handling of missing data were not reported in 16 studies (41%). CONCLUSIONS: We found widespread use of poor methods that could jeopardise model development, including univariate pre-screening of variables, categorisation of continuous risk predictors and poor handling of missing data. The use of poor methods affects the reliability of the prediction model and ultimately compromises the accuracy of the probability estimates of having undiagnosed type 2 diabetes or the predicted risk of developing type 2 diabetes. In addition, many studies were characterised by a generally poor level of reporting, with many key details to objectively judge the usefulness of the models often omitted.
spellingShingle Collins, G
Mallett, S
Omar, O
Yu, L
Developing risk prediction models for type 2 diabetes: a systematic review of methodology and reporting.
title Developing risk prediction models for type 2 diabetes: a systematic review of methodology and reporting.
title_full Developing risk prediction models for type 2 diabetes: a systematic review of methodology and reporting.
title_fullStr Developing risk prediction models for type 2 diabetes: a systematic review of methodology and reporting.
title_full_unstemmed Developing risk prediction models for type 2 diabetes: a systematic review of methodology and reporting.
title_short Developing risk prediction models for type 2 diabetes: a systematic review of methodology and reporting.
title_sort developing risk prediction models for type 2 diabetes a systematic review of methodology and reporting
work_keys_str_mv AT collinsg developingriskpredictionmodelsfortype2diabetesasystematicreviewofmethodologyandreporting
AT malletts developingriskpredictionmodelsfortype2diabetesasystematicreviewofmethodologyandreporting
AT omaro developingriskpredictionmodelsfortype2diabetesasystematicreviewofmethodologyandreporting
AT yul developingriskpredictionmodelsfortype2diabetesasystematicreviewofmethodologyandreporting