Fairness and generalizability of OCT normative databases: a comparative analysis

Abstract Purpose In supervised Machine Learning algorithms, labels and reports are important in model development. To provide a normality assessment, the OCT has an in-built normative database that provides a color base scale from the measurement database comparison. This article aims to evaluate an...

Full description

Bibliographic Details
Main Authors: Luis Filipe Nakayama, Lucas Zago Ribeiro, Juliana Angelica Estevão de Oliveira, João Carlos Ramos Gonçalves de Matos, William Greig Mitchell, Fernando Korn Malerbi, Leo Anthony Celi, Caio Vinicius Saito Regatieri
Format: Article
Language:English
Published: BMC 2023-08-01
Series:International Journal of Retina and Vitreous
Subjects:
Online Access:https://doi.org/10.1186/s40942-023-00459-8
_version_ 1827709210177044480
author Luis Filipe Nakayama
Lucas Zago Ribeiro
Juliana Angelica Estevão de Oliveira
João Carlos Ramos Gonçalves de Matos
William Greig Mitchell
Fernando Korn Malerbi
Leo Anthony Celi
Caio Vinicius Saito Regatieri
author_facet Luis Filipe Nakayama
Lucas Zago Ribeiro
Juliana Angelica Estevão de Oliveira
João Carlos Ramos Gonçalves de Matos
William Greig Mitchell
Fernando Korn Malerbi
Leo Anthony Celi
Caio Vinicius Saito Regatieri
author_sort Luis Filipe Nakayama
collection DOAJ
description Abstract Purpose In supervised Machine Learning algorithms, labels and reports are important in model development. To provide a normality assessment, the OCT has an in-built normative database that provides a color base scale from the measurement database comparison. This article aims to evaluate and compare normative databases of different OCT machines, analyzing patient demographic, contrast inclusion and exclusion criteria, diversity index, and statistical approach to assess their fairness and generalizability. Methods Data were retrieved from Cirrus, Avanti, Spectralis, and Triton’s FDA-approval and equipment manual. The following variables were compared: number of eyes and patients, inclusion and exclusion criteria, statistical approach, sex, race and ethnicity, age, participant country, and diversity index. Results Avanti OCT has the largest normative database (640 eyes). In every database, the inclusion and exclusion criteria were similar, including adult patients and excluding pathological eyes. Spectralis has the largest White (79.7%) proportionately representation, Cirrus has the largest Asian (24%), and Triton has the largest Black (22%) patient representation. In all databases, the statistical analysis applied was Regression models. The sex diversity index is similar in all datasets, and comparable to the ten most populous contries. Avanti dataset has the highest diversity index in terms of race, followed by Cirrus, Triton, and Spectralis. Conclusion In all analyzed databases, the data framework is static, with limited upgrade options and lacking normative databases for new modules. As a result, caution in OCT normality interpretation is warranted. To address these limitations, there is a need for more diverse, representative, and open-access datasets that take into account patient demographics, especially considering the development of supervised Machine Learning algorithms in healthcare.
first_indexed 2024-03-10T17:16:44Z
format Article
id doaj.art-ea8702d9f4b649b285623e9d9e8f9a41
institution Directory Open Access Journal
issn 2056-9920
language English
last_indexed 2024-03-10T17:16:44Z
publishDate 2023-08-01
publisher BMC
record_format Article
series International Journal of Retina and Vitreous
spelling doaj.art-ea8702d9f4b649b285623e9d9e8f9a412023-11-20T10:28:43ZengBMCInternational Journal of Retina and Vitreous2056-99202023-08-01911710.1186/s40942-023-00459-8Fairness and generalizability of OCT normative databases: a comparative analysisLuis Filipe Nakayama0Lucas Zago Ribeiro1Juliana Angelica Estevão de Oliveira2João Carlos Ramos Gonçalves de Matos3William Greig Mitchell4Fernando Korn Malerbi5Leo Anthony Celi6Caio Vinicius Saito Regatieri7Laboratory of Computational Physiology, Massachusetts Institute of TechnologyDepartment of Ophthalmology, São Paulo Federal UniversityDepartment of Ophthalmology, São Paulo Federal UniversityLaboratory of Computational Physiology, Massachusetts Institute of TechnologyDepartment of Ophthalmology, Royal Victorian Eye and Ear HospitalDepartment of Ophthalmology, São Paulo Federal UniversityLaboratory of Computational Physiology, Massachusetts Institute of TechnologyDepartment of Ophthalmology, São Paulo Federal UniversityAbstract Purpose In supervised Machine Learning algorithms, labels and reports are important in model development. To provide a normality assessment, the OCT has an in-built normative database that provides a color base scale from the measurement database comparison. This article aims to evaluate and compare normative databases of different OCT machines, analyzing patient demographic, contrast inclusion and exclusion criteria, diversity index, and statistical approach to assess their fairness and generalizability. Methods Data were retrieved from Cirrus, Avanti, Spectralis, and Triton’s FDA-approval and equipment manual. The following variables were compared: number of eyes and patients, inclusion and exclusion criteria, statistical approach, sex, race and ethnicity, age, participant country, and diversity index. Results Avanti OCT has the largest normative database (640 eyes). In every database, the inclusion and exclusion criteria were similar, including adult patients and excluding pathological eyes. Spectralis has the largest White (79.7%) proportionately representation, Cirrus has the largest Asian (24%), and Triton has the largest Black (22%) patient representation. In all databases, the statistical analysis applied was Regression models. The sex diversity index is similar in all datasets, and comparable to the ten most populous contries. Avanti dataset has the highest diversity index in terms of race, followed by Cirrus, Triton, and Spectralis. Conclusion In all analyzed databases, the data framework is static, with limited upgrade options and lacking normative databases for new modules. As a result, caution in OCT normality interpretation is warranted. To address these limitations, there is a need for more diverse, representative, and open-access datasets that take into account patient demographics, especially considering the development of supervised Machine Learning algorithms in healthcare.https://doi.org/10.1186/s40942-023-00459-8Optical coherence tomographyFairnessGeneralizabilityDatabaseSupervised machine learning
spellingShingle Luis Filipe Nakayama
Lucas Zago Ribeiro
Juliana Angelica Estevão de Oliveira
João Carlos Ramos Gonçalves de Matos
William Greig Mitchell
Fernando Korn Malerbi
Leo Anthony Celi
Caio Vinicius Saito Regatieri
Fairness and generalizability of OCT normative databases: a comparative analysis
International Journal of Retina and Vitreous
Optical coherence tomography
Fairness
Generalizability
Database
Supervised machine learning
title Fairness and generalizability of OCT normative databases: a comparative analysis
title_full Fairness and generalizability of OCT normative databases: a comparative analysis
title_fullStr Fairness and generalizability of OCT normative databases: a comparative analysis
title_full_unstemmed Fairness and generalizability of OCT normative databases: a comparative analysis
title_short Fairness and generalizability of OCT normative databases: a comparative analysis
title_sort fairness and generalizability of oct normative databases a comparative analysis
topic Optical coherence tomography
Fairness
Generalizability
Database
Supervised machine learning
url https://doi.org/10.1186/s40942-023-00459-8
work_keys_str_mv AT luisfilipenakayama fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis
AT lucaszagoribeiro fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis
AT julianaangelicaestevaodeoliveira fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis
AT joaocarlosramosgoncalvesdematos fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis
AT williamgreigmitchell fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis
AT fernandokornmalerbi fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis
AT leoanthonyceli fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis
AT caioviniciussaitoregatieri fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis