Development and validation of ischemic heart disease and stroke prognostic models using large-scale real-world data from Japan

Background: Previous cardiovascular risk prediction models in Japan have utilized prospective cohort studies with concise data. As the health information including health check-up records and administrative claims becomes digitalized and publicly available, application of large datasets based on suc...

Full description

Bibliographic Details
Main Authors: Shigeto Yoshida, Shu Tanaka, Masafumi Okada, Takuya Ohki, Kazumasa Yamagishi, Yasushi Okuno
Format: Article
Language:English
Published: Komiyama Printing Co. Ltd 2023-02-01
Series:Environmental Health and Preventive Medicine
Subjects:
Online Access:https://www.jstage.jst.go.jp/article/ehpm/28/0/28_22-00106/_html/-char/en
_version_ 1797893783960944640
author Shigeto Yoshida
Shu Tanaka
Masafumi Okada
Takuya Ohki
Kazumasa Yamagishi
Yasushi Okuno
author_facet Shigeto Yoshida
Shu Tanaka
Masafumi Okada
Takuya Ohki
Kazumasa Yamagishi
Yasushi Okuno
author_sort Shigeto Yoshida
collection DOAJ
description Background: Previous cardiovascular risk prediction models in Japan have utilized prospective cohort studies with concise data. As the health information including health check-up records and administrative claims becomes digitalized and publicly available, application of large datasets based on such real-world data can achieve prediction accuracy and support social implementation of cardiovascular disease risk prediction models in preventive and clinical practice. In this study, classical regression and machine learning methods were explored to develop ischemic heart disease (IHD) and stroke prognostic models using real-world data. Methods: IQVIA Japan Claims Database was searched to include 691,160 individuals (predominantly corporate employees and their families working in secondary and tertiary industries) with at least one annual health check-up record during the identification period (April 2013–December 2018). The primary outcome of the study was the first recorded IHD or stroke event. Predictors were annual health check-up records at the index year-month, comprising demographic characteristics, laboratory tests, and questionnaire features. Four prediction models (Cox, Elnet-Cox, XGBoost, and Ensemble) were assessed in the present study to develop a cardiovascular disease risk prediction model for Japan. Results: The analysis cohort consisted of 572,971 invididuals. All prediction models showed similarly good performance. The Harrell’s C-index was close to 0.9 for all IHD models, and above 0.7 for stroke models. In IHD models, age, sex, high-density lipoprotein, low-density lipoprotein, cholesterol, and systolic blood pressure had higher importance, while in stroke models systolic blood pressure and age had higher importance. Conclusion: Our study analyzed classical regression and machine learning algorithms to develop cardiovascular disease risk prediction models for IHD and stroke in Japan that can be applied to practical use in a large population with predictive accuracy.
first_indexed 2024-04-10T06:58:19Z
format Article
id doaj.art-87353cac504240ecafff622e314973a6
institution Directory Open Access Journal
issn 1342-078X
1347-4715
language English
last_indexed 2024-04-10T06:58:19Z
publishDate 2023-02-01
publisher Komiyama Printing Co. Ltd
record_format Article
series Environmental Health and Preventive Medicine
spelling doaj.art-87353cac504240ecafff622e314973a62023-02-28T04:27:44ZengKomiyama Printing Co. LtdEnvironmental Health and Preventive Medicine1342-078X1347-47152023-02-0128161610.1265/ehpm.22-00106ehpmDevelopment and validation of ischemic heart disease and stroke prognostic models using large-scale real-world data from JapanShigeto Yoshida0https://orcid.org/0000-0001-5056-6473Shu Tanaka1Masafumi Okada2Takuya Ohki3Kazumasa Yamagishi4Yasushi Okuno5Data Science and Advanced Analytics, IQVIA Solutions Japan K.K.Real-World Evidence Solutions, IQVIA Solutions Japan K.K.Real-World Evidence Solutions, IQVIA Solutions Japan K.K.Real-World Evidence Solutions, IQVIA Solutions Japan K.K.Department of Public Health Medicine, Faculty of Medicine, and Health Services Research and Development Center, University of TsukubaMedical Sciences Innovation Hub Program, RIKENBackground: Previous cardiovascular risk prediction models in Japan have utilized prospective cohort studies with concise data. As the health information including health check-up records and administrative claims becomes digitalized and publicly available, application of large datasets based on such real-world data can achieve prediction accuracy and support social implementation of cardiovascular disease risk prediction models in preventive and clinical practice. In this study, classical regression and machine learning methods were explored to develop ischemic heart disease (IHD) and stroke prognostic models using real-world data. Methods: IQVIA Japan Claims Database was searched to include 691,160 individuals (predominantly corporate employees and their families working in secondary and tertiary industries) with at least one annual health check-up record during the identification period (April 2013–December 2018). The primary outcome of the study was the first recorded IHD or stroke event. Predictors were annual health check-up records at the index year-month, comprising demographic characteristics, laboratory tests, and questionnaire features. Four prediction models (Cox, Elnet-Cox, XGBoost, and Ensemble) were assessed in the present study to develop a cardiovascular disease risk prediction model for Japan. Results: The analysis cohort consisted of 572,971 invididuals. All prediction models showed similarly good performance. The Harrell’s C-index was close to 0.9 for all IHD models, and above 0.7 for stroke models. In IHD models, age, sex, high-density lipoprotein, low-density lipoprotein, cholesterol, and systolic blood pressure had higher importance, while in stroke models systolic blood pressure and age had higher importance. Conclusion: Our study analyzed classical regression and machine learning algorithms to develop cardiovascular disease risk prediction models for IHD and stroke in Japan that can be applied to practical use in a large population with predictive accuracy.https://www.jstage.jst.go.jp/article/ehpm/28/0/28_22-00106/_html/-char/enrisk prediction modelmachine learningischemic heart diseasestrokereal-world data
spellingShingle Shigeto Yoshida
Shu Tanaka
Masafumi Okada
Takuya Ohki
Kazumasa Yamagishi
Yasushi Okuno
Development and validation of ischemic heart disease and stroke prognostic models using large-scale real-world data from Japan
Environmental Health and Preventive Medicine
risk prediction model
machine learning
ischemic heart disease
stroke
real-world data
title Development and validation of ischemic heart disease and stroke prognostic models using large-scale real-world data from Japan
title_full Development and validation of ischemic heart disease and stroke prognostic models using large-scale real-world data from Japan
title_fullStr Development and validation of ischemic heart disease and stroke prognostic models using large-scale real-world data from Japan
title_full_unstemmed Development and validation of ischemic heart disease and stroke prognostic models using large-scale real-world data from Japan
title_short Development and validation of ischemic heart disease and stroke prognostic models using large-scale real-world data from Japan
title_sort development and validation of ischemic heart disease and stroke prognostic models using large scale real world data from japan
topic risk prediction model
machine learning
ischemic heart disease
stroke
real-world data
url https://www.jstage.jst.go.jp/article/ehpm/28/0/28_22-00106/_html/-char/en
work_keys_str_mv AT shigetoyoshida developmentandvalidationofischemicheartdiseaseandstrokeprognosticmodelsusinglargescalerealworlddatafromjapan
AT shutanaka developmentandvalidationofischemicheartdiseaseandstrokeprognosticmodelsusinglargescalerealworlddatafromjapan
AT masafumiokada developmentandvalidationofischemicheartdiseaseandstrokeprognosticmodelsusinglargescalerealworlddatafromjapan
AT takuyaohki developmentandvalidationofischemicheartdiseaseandstrokeprognosticmodelsusinglargescalerealworlddatafromjapan
AT kazumasayamagishi developmentandvalidationofischemicheartdiseaseandstrokeprognosticmodelsusinglargescalerealworlddatafromjapan
AT yasushiokuno developmentandvalidationofischemicheartdiseaseandstrokeprognosticmodelsusinglargescalerealworlddatafromjapan