Fairness and generalizability of OCT normative databases: a comparative analysis

Abstract Purpose In supervised Machine Learning algorithms, labels and reports are important in model development. To provide a normality assessment, the OCT has an in-built normative database that provides a color base scale from the measurement dat...

Full description

Bibliographic Details
Main Authors: Nakayama, Luis F., Zago Ribeiro, Lucas, de Oliveira, Juliana A. E., de Matos, João C. R. G., Mitchell, William G., Malerbi, Fernando K., Celi, Leo A., Regatieri, Caio V. S.
Other Authors: Harvard--MIT Program in Health Sciences and Technology. Laboratory for Computational Physiology
Format: Article
Language:English
Published: BioMed Central 2023
Online Access:https://hdl.handle.net/1721.1/152267
_version_ 1826216575930204160
author Nakayama, Luis F.
Zago Ribeiro, Lucas
de Oliveira, Juliana A. E.
de Matos, João C. R. G.
Mitchell, William G.
Malerbi, Fernando K.
Celi, Leo A.
Regatieri, Caio V. S.
author2 Harvard--MIT Program in Health Sciences and Technology. Laboratory for Computational Physiology
author_facet Harvard--MIT Program in Health Sciences and Technology. Laboratory for Computational Physiology
Nakayama, Luis F.
Zago Ribeiro, Lucas
de Oliveira, Juliana A. E.
de Matos, João C. R. G.
Mitchell, William G.
Malerbi, Fernando K.
Celi, Leo A.
Regatieri, Caio V. S.
author_sort Nakayama, Luis F.
collection MIT
description Abstract Purpose In supervised Machine Learning algorithms, labels and reports are important in model development. To provide a normality assessment, the OCT has an in-built normative database that provides a color base scale from the measurement database comparison. This article aims to evaluate and compare normative databases of different OCT machines, analyzing patient demographic, contrast inclusion and exclusion criteria, diversity index, and statistical approach to assess their fairness and generalizability. Methods Data were retrieved from Cirrus, Avanti, Spectralis, and Triton’s FDA-approval and equipment manual. The following variables were compared: number of eyes and patients, inclusion and exclusion criteria, statistical approach, sex, race and ethnicity, age, participant country, and diversity index. Results Avanti OCT has the largest normative database (640 eyes). In every database, the inclusion and exclusion criteria were similar, including adult patients and excluding pathological eyes. Spectralis has the largest White (79.7%) proportionately representation, Cirrus has the largest Asian (24%), and Triton has the largest Black (22%) patient representation. In all databases, the statistical analysis applied was Regression models. The sex diversity index is similar in all datasets, and comparable to the ten most populous contries. Avanti dataset has the highest diversity index in terms of race, followed by Cirrus, Triton, and Spectralis. Conclusion In all analyzed databases, the data framework is static, with limited upgrade options and lacking normative databases for new modules. As a result, caution in OCT normality interpretation is warranted. To address these limitations, there is a need for more diverse, representative, and open-access datasets that take into account patient demographics, especially considering the development of supervised Machine Learning algorithms in healthcare.
first_indexed 2024-09-23T16:49:29Z
format Article
id mit-1721.1/152267
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T16:49:29Z
publishDate 2023
publisher BioMed Central
record_format dspace
spelling mit-1721.1/1522672024-01-10T18:12:56Z Fairness and generalizability of OCT normative databases: a comparative analysis Nakayama, Luis F. Zago Ribeiro, Lucas de Oliveira, Juliana A. E. de Matos, João C. R. G. Mitchell, William G. Malerbi, Fernando K. Celi, Leo A. Regatieri, Caio V. S. Harvard--MIT Program in Health Sciences and Technology. Laboratory for Computational Physiology Abstract Purpose In supervised Machine Learning algorithms, labels and reports are important in model development. To provide a normality assessment, the OCT has an in-built normative database that provides a color base scale from the measurement database comparison. This article aims to evaluate and compare normative databases of different OCT machines, analyzing patient demographic, contrast inclusion and exclusion criteria, diversity index, and statistical approach to assess their fairness and generalizability. Methods Data were retrieved from Cirrus, Avanti, Spectralis, and Triton’s FDA-approval and equipment manual. The following variables were compared: number of eyes and patients, inclusion and exclusion criteria, statistical approach, sex, race and ethnicity, age, participant country, and diversity index. Results Avanti OCT has the largest normative database (640 eyes). In every database, the inclusion and exclusion criteria were similar, including adult patients and excluding pathological eyes. Spectralis has the largest White (79.7%) proportionately representation, Cirrus has the largest Asian (24%), and Triton has the largest Black (22%) patient representation. In all databases, the statistical analysis applied was Regression models. The sex diversity index is similar in all datasets, and comparable to the ten most populous contries. Avanti dataset has the highest diversity index in terms of race, followed by Cirrus, Triton, and Spectralis. Conclusion In all analyzed databases, the data framework is static, with limited upgrade options and lacking normative databases for new modules. As a result, caution in OCT normality interpretation is warranted. To address these limitations, there is a need for more diverse, representative, and open-access datasets that take into account patient demographics, especially considering the development of supervised Machine Learning algorithms in healthcare. 2023-09-27T18:25:04Z 2023-09-27T18:25:04Z 2023-08-21 2023-08-27T03:12:04Z Article http://purl.org/eprint/type/JournalArticle https://hdl.handle.net/1721.1/152267 International Journal of Retina and Vitreous. 2023 Aug 21;9(1):48 PUBLISHER_CC en https://doi.org/10.1186/s40942-023-00459-8 Creative Commons Attribution https://creativecommons.org/licenses/by/4.0/ Brazilian Retina and Vitreous Society application/pdf BioMed Central BioMed Central
spellingShingle Nakayama, Luis F.
Zago Ribeiro, Lucas
de Oliveira, Juliana A. E.
de Matos, João C. R. G.
Mitchell, William G.
Malerbi, Fernando K.
Celi, Leo A.
Regatieri, Caio V. S.
Fairness and generalizability of OCT normative databases: a comparative analysis
title Fairness and generalizability of OCT normative databases: a comparative analysis
title_full Fairness and generalizability of OCT normative databases: a comparative analysis
title_fullStr Fairness and generalizability of OCT normative databases: a comparative analysis
title_full_unstemmed Fairness and generalizability of OCT normative databases: a comparative analysis
title_short Fairness and generalizability of OCT normative databases: a comparative analysis
title_sort fairness and generalizability of oct normative databases a comparative analysis
url https://hdl.handle.net/1721.1/152267
work_keys_str_mv AT nakayamaluisf fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis
AT zagoribeirolucas fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis
AT deoliveirajulianaae fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis
AT dematosjoaocrg fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis
AT mitchellwilliamg fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis
AT malerbifernandok fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis
AT celileoa fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis
AT regatiericaiovs fairnessandgeneralizabilityofoctnormativedatabasesacomparativeanalysis