Fairness and generalizability of OCT normative databases: a comparative analysis

Abstract Purpose In supervised Machine Learning algorithms, labels and reports are important in model development. To provide a normality assessment, the OCT has an in-built normative database that provides a color base scale from the measurement database comparison. This article aims to evaluate an...

Full description

Bibliographic Details
Main Authors:	Luis Filipe Nakayama, Lucas Zago Ribeiro, Juliana Angelica Estevão de Oliveira, João Carlos Ramos Gonçalves de Matos, William Greig Mitchell, Fernando Korn Malerbi, Leo Anthony Celi, Caio Vinicius Saito Regatieri
Format:	Article
Language:	English
Published:	BMC 2023-08-01
Series:	International Journal of Retina and Vitreous
Subjects:	Optical coherence tomography Fairness Generalizability Database Supervised machine learning
Online Access:	https://doi.org/10.1186/s40942-023-00459-8

_version_	1827709210177044480
author	Luis Filipe Nakayama Lucas Zago Ribeiro Juliana Angelica Estevão de Oliveira João Carlos Ramos Gonçalves de Matos William Greig Mitchell Fernando Korn Malerbi Leo Anthony Celi Caio Vinicius Saito Regatieri
author_facet	Luis Filipe Nakayama Lucas Zago Ribeiro Juliana Angelica Estevão de Oliveira João Carlos Ramos Gonçalves de Matos William Greig Mitchell Fernando Korn Malerbi Leo Anthony Celi Caio Vinicius Saito Regatieri
author_sort	Luis Filipe Nakayama
collection	DOAJ
description	Abstract Purpose In supervised Machine Learning algorithms, labels and reports are important in model development. To provide a normality assessment, the OCT has an in-built normative database that provides a color base scale from the measurement database comparison. This article aims to evaluate and compare normative databases of different OCT machines, analyzing patient demographic, contrast inclusion and exclusion criteria, diversity index, and statistical approach to assess their fairness and generalizability. Methods Data were retrieved from Cirrus, Avanti, Spectralis, and Triton’s FDA-approval and equipment manual. The following variables were compared: number of eyes and patients, inclusion and exclusion criteria, statistical approach, sex, race and ethnicity, age, participant country, and diversity index. Results Avanti OCT has the largest normative database (640 eyes). In every database, the inclusion and exclusion criteria were similar, including adult patients and excluding pathological eyes. Spectralis has the largest White (79.7%) proportionately representation, Cirrus has the largest Asian (24%), and Triton has the largest Black (22%) patient representation. In all databases, the statistical analysis applied was Regression models. The sex diversity index is similar in all datasets, and comparable to the ten most populous contries. Avanti dataset has the highest diversity index in terms of race, followed by Cirrus, Triton, and Spectralis. Conclusion In all analyzed databases, the data framework is static, with limited upgrade options and lacking normative databases for new modules. As a result, caution in OCT normality interpretation is warranted. To address these limitations, there is a need for more diverse, representative, and open-access datasets that take into account patient demographics, especially considering the development of supervised Machine Learning algorithms in healthcare.
first_indexed	2024-03-10T17:16:44Z
format	Article
id	doaj.art-ea8702d9f4b649b285623e9d9e8f9a41
institution	Directory Open Access Journal
issn	2056-9920
language	English
last_indexed	2024-03-10T17:16:44Z
publishDate	2023-08-01
publisher	BMC
record_format	Article
series	International Journal of Retina and Vitreous
spelling	doaj.art-ea8702d9f4b649b285623e9d9e8f9a412023-11-20T10:28:43ZengBMCInternational Journal of Retina and Vitreous2056-99202023-08-01911710.1186/s40942-023-00459-8Fairness and generalizability of OCT normative databases: a comparative analysisLuis Filipe Nakayama0Lucas Zago Ribeiro1Juliana Angelica Estevão de Oliveira2João Carlos Ramos Gonçalves de Matos3William Greig Mitchell4Fernando Korn Malerbi5Leo Anthony Celi6Caio Vinicius Saito Regatieri7Laboratory of Computational Physiology, Massachusetts Institute of TechnologyDepartment of Ophthalmology, São Paulo Federal UniversityDepartment of Ophthalmology, São Paulo Federal UniversityLaboratory of Computational Physiology, Massachusetts Institute of TechnologyDepartment of Ophthalmology, Royal Victorian Eye and Ear HospitalDepartment of Ophthalmology, São Paulo Federal UniversityLaboratory of Computational Physiology, Massachusetts Institute of TechnologyDepartment of Ophthalmology, São Paulo Federal UniversityAbstract Purpose In supervised Machine Learning algorithms, labels and reports are important in model development. To provide a normality assessment, the OCT has an in-built normative database that provides a color base scale from the measurement database comparison. This article aims to evaluate and compare normative databases of different OCT machines, analyzing patient demographic, contrast inclusion and exclusion criteria, diversity index, and statistical approach to assess their fairness and generalizability. Methods Data were retrieved from Cirrus, Avanti, Spectralis, and Triton’s FDA-approval and equipment manual. The following variables were compared: number of eyes and patients, inclusion and exclusion criteria, statistical approach, sex, race and ethnicity, age, participant country, and diversity index. Results Avanti OCT has the largest normative database (640 eyes). In every database, the inclusion and exclusion criteria were similar, including adult patients and excluding pathological eyes. Spectralis has the largest White (79.7%) proportionately representation, Cirrus has the largest Asian (24%), and Triton has the largest Black (22%) patient representation. In all databases, the statistical analysis applied was Regression models. The sex diversity index is similar in all datasets, and comparable to the ten most populous contries. Avanti dataset has the highest diversity index in terms of race, followed by Cirrus, Triton, and Spectralis. Conclusion In all analyzed databases, the data framework is static, with limited upgrade options and lacking normative databases for new modules. As a result, caution in OCT normality interpretation is warranted. To address these limitations, there is a need for more diverse, representative, and open-access datasets that take into account patient demographics, especially considering the development of supervised Machine Learning algorithms in healthcare.https://doi.org/10.1186/s40942-023-00459-8Optical coherence tomographyFairnessGeneralizabilityDatabaseSupervised machine learning
spellingShingle	Luis Filipe Nakayama Lucas Zago Ribeiro Juliana Angelica Estevão de Oliveira João Carlos Ramos Gonçalves de Matos William Greig Mitchell Fernando Korn Malerbi Leo Anthony Celi Caio Vinicius Saito Regatieri Fairness and generalizability of OCT normative databases: a comparative analysis International Journal of Retina and Vitreous Optical coherence tomography Fairness Generalizability Database Supervised machine learning
title	Fairness and generalizability of OCT normative databases: a comparative analysis
title_full	Fairness and generalizability of OCT normative databases: a comparative analysis
title_fullStr	Fairness and generalizability of OCT normative databases: a comparative analysis
title_full_unstemmed	Fairness and generalizability of OCT normative databases: a comparative analysis
title_short	Fairness and generalizability of OCT normative databases: a comparative analysis
title_sort	fairness and generalizability of oct normative databases a comparative analysis
topic	Optical coherence tomography Fairness Generalizability Database Supervised machine learning
url	https://doi.org/10.1186/s40942-023-00459-8
work_keys_str_mv	AT luisfilipenakayama fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis AT lucaszagoribeiro fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis AT julianaangelicaestevaodeoliveira fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis AT joaocarlosramosgoncalvesdematos fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis AT williamgreigmitchell fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis AT fernandokornmalerbi fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis AT leoanthonyceli fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis AT caioviniciussaitoregatieri fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis

Fairness and generalizability of OCT normative databases: a comparative analysis

Similar Items