Prediction Models for Glaucoma in a Multicenter Electronic Health Records Consortium: The Sight Outcomes Research Collaborative

Purpose: Advances in artificial intelligence have enabled the development of predictive models for glaucoma. However, most work is single-center and uncertainty exists regarding the generalizability of such models. The purpose of this study was to build and evaluate machine learning (ML) approaches...

Full description

Bibliographic Details
Main Authors: Sophia Y. Wang, MD, MS, Rohith Ravindranath, MS, Joshua D. Stein, MD, MS, Sejal Amin, Paul A. Edwards, Divya Srikumaran, Fasika Woreta, Jeffrey S. Schultz, Anurag Shrivastava, Baseer Ahmad, Judy Kim, Paul Bryar, Dustin French, Brian L. Vanderbeek, Suzann Pershing, Sophia Y. Wang, Anne M. Lynch, Jenna Patnaik, Saleha Munir, Wuqaas Munir, Joshua Stein, Lindsey DeLott, Brian C. Stagg, Barbara Wirostko, Brian McMillian, Arsham Sheybani
Format: Article
Language:English
Published: Elsevier 2024-05-01
Series:Ophthalmology Science
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S266691452300177X
_version_ 1797345852367306752
author Sophia Y. Wang, MD, MS
Rohith Ravindranath, MS
Joshua D. Stein, MD, MS
Sejal Amin
Paul A. Edwards
Divya Srikumaran
Fasika Woreta
Jeffrey S. Schultz
Anurag Shrivastava
Baseer Ahmad
Judy Kim
Paul Bryar
Dustin French
Brian L. Vanderbeek
Suzann Pershing
Sophia Y. Wang
Anne M. Lynch
Jenna Patnaik
Saleha Munir
Wuqaas Munir
Joshua Stein
Lindsey DeLott
Brian C. Stagg
Barbara Wirostko
Brian McMillian
Arsham Sheybani
author_facet Sophia Y. Wang, MD, MS
Rohith Ravindranath, MS
Joshua D. Stein, MD, MS
Sejal Amin
Paul A. Edwards
Divya Srikumaran
Fasika Woreta
Jeffrey S. Schultz
Anurag Shrivastava
Baseer Ahmad
Judy Kim
Paul Bryar
Dustin French
Brian L. Vanderbeek
Suzann Pershing
Sophia Y. Wang
Anne M. Lynch
Jenna Patnaik
Saleha Munir
Wuqaas Munir
Joshua Stein
Lindsey DeLott
Brian C. Stagg
Barbara Wirostko
Brian McMillian
Arsham Sheybani
author_sort Sophia Y. Wang, MD, MS
collection DOAJ
description Purpose: Advances in artificial intelligence have enabled the development of predictive models for glaucoma. However, most work is single-center and uncertainty exists regarding the generalizability of such models. The purpose of this study was to build and evaluate machine learning (ML) approaches to predict glaucoma progression requiring surgery using data from a large multicenter consortium of electronic health records (EHR). Design: Cohort study. Participants: Thirty-six thousand five hundred forty-eight patients with glaucoma, as identified by International Classification of Diseases (ICD) codes from 6 academic eye centers participating in the Sight OUtcomes Research Collaborative (SOURCE). Methods: We developed ML models to predict whether patients with glaucoma would progress to glaucoma surgery in the coming year (identified by Current Procedural Terminology codes) using the following modeling approaches: (1) penalized logistic regression (lasso, ridge, and elastic net); (2) tree-based models (random forest, gradient boosted machines, and XGBoost), and (3) deep learning models. Model input features included demographics, diagnosis codes, medications, and clinical information (intraocular pressure, visual acuity, refractive status, and central corneal thickness) available from structured EHR data. One site was reserved as an “external site” test set (N = 1550); of the patients from the remaining sites, 10% each were randomly selected to be in development and test sets, with the remaining 27 999 reserved for model training. Main Outcome Measures: Evaluation metrics included area under the receiver operating characteristic curve (AUROC) on the test set and the external site. Results: Six thousand nineteen (16.5%) of 36 548 patients underwent glaucoma surgery. Overall, the AUROC ranged from 0.735 to 0.771 on the random test set and from 0.706 to 0.754 on the external test site, with the XGBoost and random forest model performing best, respectively. There was greatest performance decrease from the random test set to the external test site for the penalized regression models. Conclusions: Machine learning models developed using structured EHR data can reasonably predict whether glaucoma patients will need surgery, with reasonable generalizability to an external site. Additional research is needed to investigate the impact of protected class characteristics such as race or gender on model performance and fairness. Financial Disclosure(s): Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.
first_indexed 2024-03-08T11:24:43Z
format Article
id doaj.art-6abe49059f6a402aac3a3c51041e5793
institution Directory Open Access Journal
issn 2666-9145
language English
last_indexed 2024-03-08T11:24:43Z
publishDate 2024-05-01
publisher Elsevier
record_format Article
series Ophthalmology Science
spelling doaj.art-6abe49059f6a402aac3a3c51041e57932024-01-26T05:35:33ZengElsevierOphthalmology Science2666-91452024-05-0143100445Prediction Models for Glaucoma in a Multicenter Electronic Health Records Consortium: The Sight Outcomes Research CollaborativeSophia Y. Wang, MD, MS0Rohith Ravindranath, MS1Joshua D. Stein, MD, MS2Sejal AminPaul A. EdwardsDivya SrikumaranFasika WoretaJeffrey S. SchultzAnurag ShrivastavaBaseer AhmadJudy KimPaul BryarDustin FrenchBrian L. VanderbeekSuzann PershingSophia Y. WangAnne M. LynchJenna PatnaikSaleha MunirWuqaas MunirJoshua SteinLindsey DeLottBrian C. StaggBarbara WirostkoBrian McMillianArsham SheybaniDepartment of Ophthalmology, Byers Eye Institute, Stanford University, Palo Alto, California; Correspondence: Sophia Y. Wang, MD, MS, Department of Ophthalmology, Byers Eye Institute, Stanford University, 2370 Watson Ct, Palo Alto, CA 94303.Department of Ophthalmology, Byers Eye Institute, Stanford University, Palo Alto, CaliforniaDepartment of Ophthalmology & Visual Sciences, University of Michigan Kellogg Eye Center, Ann Arbor, MichiganPurpose: Advances in artificial intelligence have enabled the development of predictive models for glaucoma. However, most work is single-center and uncertainty exists regarding the generalizability of such models. The purpose of this study was to build and evaluate machine learning (ML) approaches to predict glaucoma progression requiring surgery using data from a large multicenter consortium of electronic health records (EHR). Design: Cohort study. Participants: Thirty-six thousand five hundred forty-eight patients with glaucoma, as identified by International Classification of Diseases (ICD) codes from 6 academic eye centers participating in the Sight OUtcomes Research Collaborative (SOURCE). Methods: We developed ML models to predict whether patients with glaucoma would progress to glaucoma surgery in the coming year (identified by Current Procedural Terminology codes) using the following modeling approaches: (1) penalized logistic regression (lasso, ridge, and elastic net); (2) tree-based models (random forest, gradient boosted machines, and XGBoost), and (3) deep learning models. Model input features included demographics, diagnosis codes, medications, and clinical information (intraocular pressure, visual acuity, refractive status, and central corneal thickness) available from structured EHR data. One site was reserved as an “external site” test set (N = 1550); of the patients from the remaining sites, 10% each were randomly selected to be in development and test sets, with the remaining 27 999 reserved for model training. Main Outcome Measures: Evaluation metrics included area under the receiver operating characteristic curve (AUROC) on the test set and the external site. Results: Six thousand nineteen (16.5%) of 36 548 patients underwent glaucoma surgery. Overall, the AUROC ranged from 0.735 to 0.771 on the random test set and from 0.706 to 0.754 on the external test site, with the XGBoost and random forest model performing best, respectively. There was greatest performance decrease from the random test set to the external test site for the penalized regression models. Conclusions: Machine learning models developed using structured EHR data can reasonably predict whether glaucoma patients will need surgery, with reasonable generalizability to an external site. Additional research is needed to investigate the impact of protected class characteristics such as race or gender on model performance and fairness. Financial Disclosure(s): Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.http://www.sciencedirect.com/science/article/pii/S266691452300177XMachine learningGlaucomaMulticenter studyDeep learning
spellingShingle Sophia Y. Wang, MD, MS
Rohith Ravindranath, MS
Joshua D. Stein, MD, MS
Sejal Amin
Paul A. Edwards
Divya Srikumaran
Fasika Woreta
Jeffrey S. Schultz
Anurag Shrivastava
Baseer Ahmad
Judy Kim
Paul Bryar
Dustin French
Brian L. Vanderbeek
Suzann Pershing
Sophia Y. Wang
Anne M. Lynch
Jenna Patnaik
Saleha Munir
Wuqaas Munir
Joshua Stein
Lindsey DeLott
Brian C. Stagg
Barbara Wirostko
Brian McMillian
Arsham Sheybani
Prediction Models for Glaucoma in a Multicenter Electronic Health Records Consortium: The Sight Outcomes Research Collaborative
Ophthalmology Science
Machine learning
Glaucoma
Multicenter study
Deep learning
title Prediction Models for Glaucoma in a Multicenter Electronic Health Records Consortium: The Sight Outcomes Research Collaborative
title_full Prediction Models for Glaucoma in a Multicenter Electronic Health Records Consortium: The Sight Outcomes Research Collaborative
title_fullStr Prediction Models for Glaucoma in a Multicenter Electronic Health Records Consortium: The Sight Outcomes Research Collaborative
title_full_unstemmed Prediction Models for Glaucoma in a Multicenter Electronic Health Records Consortium: The Sight Outcomes Research Collaborative
title_short Prediction Models for Glaucoma in a Multicenter Electronic Health Records Consortium: The Sight Outcomes Research Collaborative
title_sort prediction models for glaucoma in a multicenter electronic health records consortium the sight outcomes research collaborative
topic Machine learning
Glaucoma
Multicenter study
Deep learning
url http://www.sciencedirect.com/science/article/pii/S266691452300177X
work_keys_str_mv AT sophiaywangmdms predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT rohithravindranathms predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT joshuadsteinmdms predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT sejalamin predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT paulaedwards predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT divyasrikumaran predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT fasikaworeta predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT jeffreysschultz predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT anuragshrivastava predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT baseerahmad predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT judykim predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT paulbryar predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT dustinfrench predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT brianlvanderbeek predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT suzannpershing predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT sophiaywang predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT annemlynch predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT jennapatnaik predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT salehamunir predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT wuqaasmunir predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT joshuastein predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT lindseydelott predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT briancstagg predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT barbarawirostko predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT brianmcmillian predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative
AT arshamsheybani predictionmodelsforglaucomainamulticenterelectronichealthrecordsconsortiumthesightoutcomesresearchcollaborative