A scoping review of the landscape of health-related open datasets in Latin America.

Artificial intelligence (AI) algorithms have the potential to revolutionize healthcare, but their successful translation into clinical practice has been limited. One crucial factor is the data used to train these algorithms, which must be representative of the population. However, most healthcare da...

Full description

Bibliographic Details
Main Authors: David Restrepo, Justin Quion, Constanza Vásquez-Venegas, Cleva Villanueva, Leo Anthony Celi, Luis Filipe Nakayama
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2023-10-01
Series:PLOS Digital Health
Online Access:https://journals.plos.org/digitalhealth/article/file?id=10.1371/journal.pdig.0000368&type=printable
_version_ 1797643545340805120
author David Restrepo
Justin Quion
Constanza Vásquez-Venegas
Cleva Villanueva
Leo Anthony Celi
Luis Filipe Nakayama
author_facet David Restrepo
Justin Quion
Constanza Vásquez-Venegas
Cleva Villanueva
Leo Anthony Celi
Luis Filipe Nakayama
author_sort David Restrepo
collection DOAJ
description Artificial intelligence (AI) algorithms have the potential to revolutionize healthcare, but their successful translation into clinical practice has been limited. One crucial factor is the data used to train these algorithms, which must be representative of the population. However, most healthcare databases are derived from high-income countries, leading to non-representative models and potentially exacerbating health inequities. This review focuses on the landscape of health-related open datasets in Latin America, aiming to identify existing datasets, examine data-sharing frameworks, techniques, platforms, and formats, and identify best practices in Latin America. The review found 61 datasets from 23 countries, with the DATASUS dataset from Brazil contributing to the majority of articles. The analysis revealed a dearth of datasets created by the authors themselves, indicating a reliance on existing open datasets. The findings underscore the importance of promoting open data in Latin America. We provide recommendations for enhancing data sharing in the region.
first_indexed 2024-03-11T14:16:26Z
format Article
id doaj.art-a09a50a8c17f4a5f8ad8a9539b876ab3
institution Directory Open Access Journal
issn 2767-3170
language English
last_indexed 2024-03-11T14:16:26Z
publishDate 2023-10-01
publisher Public Library of Science (PLoS)
record_format Article
series PLOS Digital Health
spelling doaj.art-a09a50a8c17f4a5f8ad8a9539b876ab32023-11-01T05:32:12ZengPublic Library of Science (PLoS)PLOS Digital Health2767-31702023-10-01210e000036810.1371/journal.pdig.0000368A scoping review of the landscape of health-related open datasets in Latin America.David RestrepoJustin QuionConstanza Vásquez-VenegasCleva VillanuevaLeo Anthony CeliLuis Filipe NakayamaArtificial intelligence (AI) algorithms have the potential to revolutionize healthcare, but their successful translation into clinical practice has been limited. One crucial factor is the data used to train these algorithms, which must be representative of the population. However, most healthcare databases are derived from high-income countries, leading to non-representative models and potentially exacerbating health inequities. This review focuses on the landscape of health-related open datasets in Latin America, aiming to identify existing datasets, examine data-sharing frameworks, techniques, platforms, and formats, and identify best practices in Latin America. The review found 61 datasets from 23 countries, with the DATASUS dataset from Brazil contributing to the majority of articles. The analysis revealed a dearth of datasets created by the authors themselves, indicating a reliance on existing open datasets. The findings underscore the importance of promoting open data in Latin America. We provide recommendations for enhancing data sharing in the region.https://journals.plos.org/digitalhealth/article/file?id=10.1371/journal.pdig.0000368&type=printable
spellingShingle David Restrepo
Justin Quion
Constanza Vásquez-Venegas
Cleva Villanueva
Leo Anthony Celi
Luis Filipe Nakayama
A scoping review of the landscape of health-related open datasets in Latin America.
PLOS Digital Health
title A scoping review of the landscape of health-related open datasets in Latin America.
title_full A scoping review of the landscape of health-related open datasets in Latin America.
title_fullStr A scoping review of the landscape of health-related open datasets in Latin America.
title_full_unstemmed A scoping review of the landscape of health-related open datasets in Latin America.
title_short A scoping review of the landscape of health-related open datasets in Latin America.
title_sort scoping review of the landscape of health related open datasets in latin america
url https://journals.plos.org/digitalhealth/article/file?id=10.1371/journal.pdig.0000368&type=printable
work_keys_str_mv AT davidrestrepo ascopingreviewofthelandscapeofhealthrelatedopendatasetsinlatinamerica
AT justinquion ascopingreviewofthelandscapeofhealthrelatedopendatasetsinlatinamerica
AT constanzavasquezvenegas ascopingreviewofthelandscapeofhealthrelatedopendatasetsinlatinamerica
AT clevavillanueva ascopingreviewofthelandscapeofhealthrelatedopendatasetsinlatinamerica
AT leoanthonyceli ascopingreviewofthelandscapeofhealthrelatedopendatasetsinlatinamerica
AT luisfilipenakayama ascopingreviewofthelandscapeofhealthrelatedopendatasetsinlatinamerica
AT davidrestrepo scopingreviewofthelandscapeofhealthrelatedopendatasetsinlatinamerica
AT justinquion scopingreviewofthelandscapeofhealthrelatedopendatasetsinlatinamerica
AT constanzavasquezvenegas scopingreviewofthelandscapeofhealthrelatedopendatasetsinlatinamerica
AT clevavillanueva scopingreviewofthelandscapeofhealthrelatedopendatasetsinlatinamerica
AT leoanthonyceli scopingreviewofthelandscapeofhealthrelatedopendatasetsinlatinamerica
AT luisfilipenakayama scopingreviewofthelandscapeofhealthrelatedopendatasetsinlatinamerica