Summary: | Abstract Absolute risks of stroke are typically estimated using measurements of cardiovascular disease risk factors recorded at a single visit. However, the comparative utility of single versus sequential risk factor measurements for stroke prediction is unclear. Risk factors were recorded on three separate visits on 13,753 individuals in the prospective China Kadoorie Biobank. All participants were stroke-free at baseline (2004–2008), first resurvey (2008), and second resurvey (2013–2014), and were followed-up for incident cases of first stroke in the 3 years following the second resurvey. To reflect the models currently used in clinical practice, sex-specific Cox models were developed to estimate 3-year risks of stroke using single measurements recorded at second resurvey and were retrospectively applied to risk factor data from previous visits. Temporal trends in the Cox-generated risk estimates from 2004 to 2014 were analyzed using linear mixed effects models. To assess the value of more flexible machine learning approaches and the incorporation of longitudinal data, we developed gradient boosted tree (GBT) models for 3-year prediction of stroke using both single measurements and sequential measurements of risk factor inputs. Overall, Cox-generated estimates for 3-year stroke risk increased by 0.3% per annum in men and 0.2% per annum in women, but varied substantially between individuals. The risk estimates at second resurvey were highly correlated with the annual increase of risk for each individual (men: r = 0.91, women: r = 0.89), and performance of the longitudinal GBT models was comparable with both Cox and GBT models that considered measurements from only a single visit (AUCs: 0.779–0.811 in men, 0.724–0.756 in women). These results provide support for current clinical guidelines, which recommend using risk factor measurements recorded at a single visit for stroke prediction.
|