Federated Learning for cross-jurisdictional analyses: A case study.

The objective of this project is to implement a harmonized artificial intelligence (AI)-based de-identification of free-text medical data across multiple Canadian jurisdictions. This federated learning approach will allow these jurisdictions to leverage each other’s data and resources while no indi...

Full description

Bibliographic Details
Main Authors: Mahmoud Azimaee, Lisa M. Lix
Format: Article
Language:English
Published: Swansea University 2022-08-01
Series:International Journal of Population Data Science
Subjects:
Online Access:https://ijpds.org/article/view/2026
_version_ 1797428812693110784
author Mahmoud Azimaee
Lisa M. Lix
author_facet Mahmoud Azimaee
Lisa M. Lix
author_sort Mahmoud Azimaee
collection DOAJ
description The objective of this project is to implement a harmonized artificial intelligence (AI)-based de-identification of free-text medical data across multiple Canadian jurisdictions. This federated learning approach will allow these jurisdictions to leverage each other’s data and resources while no individual-level data leaves the jurisdiction. Federated Learning enables health data centers in different jurisdictions to collaborate in training machine learning models without sharing individual-level data. This approach will significantly reduce privacy and cybersecurity risks and barriers that are involved in sharing and moving data across different jurisdictions. In a federated learning environment, machine learning models are trained on multiple data sources available in local data centers; local data are not shared to a central computing/analysis environment. Instead, parameters (such as model weights) are shared between these local data centers to generate a global model that will be shared and used by all participating data centers. In this case study, four health research data centers in different Canadian provinces will take part in deployment of an AI-based application for de-identification of free-text data. The data centers are members of Health Data Research Network (HDRN) Canada.  The deployment will include: • harmonized annotation and labeling of local data, • local training of entity recognition algorithms, • integrating model weights from each data centers to create a global model • development of license agreements between the participating data centers to allow sharing model weights This is an ongoing project. The talk will demonstrate learning experiences, advantages, and challenges in a federated learning environment and explore the feasibility of transporting this approach to other multi-jurisdiction research networks.
first_indexed 2024-03-09T09:04:13Z
format Article
id doaj.art-83e4697427a94ac5a94efad124a9b7a1
institution Directory Open Access Journal
issn 2399-4908
language English
last_indexed 2024-03-09T09:04:13Z
publishDate 2022-08-01
publisher Swansea University
record_format Article
series International Journal of Population Data Science
spelling doaj.art-83e4697427a94ac5a94efad124a9b7a12023-12-02T10:46:58ZengSwansea UniversityInternational Journal of Population Data Science2399-49082022-08-017310.23889/ijpds.v7i3.2026Federated Learning for cross-jurisdictional analyses: A case study.Mahmoud Azimaee0Lisa M. Lix1Institute for Clinical Evaluation Sciences (ICES)University of Manitoba The objective of this project is to implement a harmonized artificial intelligence (AI)-based de-identification of free-text medical data across multiple Canadian jurisdictions. This federated learning approach will allow these jurisdictions to leverage each other’s data and resources while no individual-level data leaves the jurisdiction. Federated Learning enables health data centers in different jurisdictions to collaborate in training machine learning models without sharing individual-level data. This approach will significantly reduce privacy and cybersecurity risks and barriers that are involved in sharing and moving data across different jurisdictions. In a federated learning environment, machine learning models are trained on multiple data sources available in local data centers; local data are not shared to a central computing/analysis environment. Instead, parameters (such as model weights) are shared between these local data centers to generate a global model that will be shared and used by all participating data centers. In this case study, four health research data centers in different Canadian provinces will take part in deployment of an AI-based application for de-identification of free-text data. The data centers are members of Health Data Research Network (HDRN) Canada.  The deployment will include: • harmonized annotation and labeling of local data, • local training of entity recognition algorithms, • integrating model weights from each data centers to create a global model • development of license agreements between the participating data centers to allow sharing model weights This is an ongoing project. The talk will demonstrate learning experiences, advantages, and challenges in a federated learning environment and explore the feasibility of transporting this approach to other multi-jurisdiction research networks. https://ijpds.org/article/view/2026Federated LearningArtificial IntelligenceMachine LearningDe-identificationcross-jurisdictional
spellingShingle Mahmoud Azimaee
Lisa M. Lix
Federated Learning for cross-jurisdictional analyses: A case study.
International Journal of Population Data Science
Federated Learning
Artificial Intelligence
Machine Learning
De-identification
cross-jurisdictional
title Federated Learning for cross-jurisdictional analyses: A case study.
title_full Federated Learning for cross-jurisdictional analyses: A case study.
title_fullStr Federated Learning for cross-jurisdictional analyses: A case study.
title_full_unstemmed Federated Learning for cross-jurisdictional analyses: A case study.
title_short Federated Learning for cross-jurisdictional analyses: A case study.
title_sort federated learning for cross jurisdictional analyses a case study
topic Federated Learning
Artificial Intelligence
Machine Learning
De-identification
cross-jurisdictional
url https://ijpds.org/article/view/2026
work_keys_str_mv AT mahmoudazimaee federatedlearningforcrossjurisdictionalanalysesacasestudy
AT lisamlix federatedlearningforcrossjurisdictionalanalysesacasestudy