Record linkage based patient intersection cardinality for rare disease studies using Mainzelliste and secure multi-party computation

Abstract Background The low number of patients suffering from any given rare diseases poses a difficult problem for medical research: With the exception of some specialized biobanks and disease registries, potential study participants’ information are disjoint and distributed over many medical insti...

Full description

Bibliographic Details
Main Authors: Tobias Kussel, Torben Brenner, Galina Tremper, Josef Schepers, Martin Lablans, Kay Hamacher
Format: Article
Language:English
Published: BMC 2022-10-01
Series:Journal of Translational Medicine
Subjects:
Online Access:https://doi.org/10.1186/s12967-022-03671-6
_version_ 1817979672403640320
author Tobias Kussel
Torben Brenner
Galina Tremper
Josef Schepers
Martin Lablans
Kay Hamacher
author_facet Tobias Kussel
Torben Brenner
Galina Tremper
Josef Schepers
Martin Lablans
Kay Hamacher
author_sort Tobias Kussel
collection DOAJ
description Abstract Background The low number of patients suffering from any given rare diseases poses a difficult problem for medical research: With the exception of some specialized biobanks and disease registries, potential study participants’ information are disjoint and distributed over many medical institutions. Whenever some of those facilities are in close proximity, a significant overlap of patients can reasonably be expected, further complicating statistical study feasibility assessments and data gathering. Due to the sensitive nature of medical records and identifying data, data transfer and joint computations are often forbidden by law or associated with prohibitive amounts of effort. To alleviate this problem and to support rare disease research, we developed the Mainzelliste Secure EpiLinker (MainSEL) record linkage framework, a secure Multi-Party Computation based application using trusted-third-party-less cryptographic protocols to perform privacy-preserving record linkage with high security guarantees. In this work, we extend MainSEL to allow the record linkage based calculation of the number of common patients between institutions. This allows privacy-preserving statistical feasibility estimations for further analyses and data consolidation. Additionally, we created easy to deploy software packages using microservice containerization and continuous deployment/continuous integration. We performed tests with medical researchers using MainSEL in real-world medical IT environments, using synthetic patient data. Results We show that MainSEL achieves practical runtimes, performing 10 000 comparisons in approximately 5 minutes. Our approach proved to be feasible in a wide range of network settings and use cases. The “lessons learned” from the real-world testing show the need to explicitly support and document the usage and deployment for both analysis pipeline integration and researcher driven ad-hoc analysis use cases, thus clarifying the wide applicability of our software. MainSEL is freely available under: https://github.com/medicalinformatics/MainSEL Conclusions MainSEL performs well in real-world settings and is a useful tool not only for rare disease research, but medical research in general. It achieves practical runtimes, improved security guarantees compared to existing solutions, and is simple to deploy in strict clinical IT environments. Based on the “lessons learned” from the real-word testing, we hope to enable a wide range of medical researchers to meet their needs and requirements using modern privacy-preserving technologies.
first_indexed 2024-04-13T22:45:34Z
format Article
id doaj.art-6f592381ecb847c3bd2b77e2d17be5de
institution Directory Open Access Journal
issn 1479-5876
language English
last_indexed 2024-04-13T22:45:34Z
publishDate 2022-10-01
publisher BMC
record_format Article
series Journal of Translational Medicine
spelling doaj.art-6f592381ecb847c3bd2b77e2d17be5de2022-12-22T02:26:24ZengBMCJournal of Translational Medicine1479-58762022-10-0120111410.1186/s12967-022-03671-6Record linkage based patient intersection cardinality for rare disease studies using Mainzelliste and secure multi-party computationTobias Kussel0Torben Brenner1Galina Tremper2Josef Schepers3Martin Lablans4Kay Hamacher5Technische Universität DarmstadtGerman Cancer Research CenterGerman Cancer Research CenterBerlin Institute of HealthGerman Cancer Research CenterTechnische Universität DarmstadtAbstract Background The low number of patients suffering from any given rare diseases poses a difficult problem for medical research: With the exception of some specialized biobanks and disease registries, potential study participants’ information are disjoint and distributed over many medical institutions. Whenever some of those facilities are in close proximity, a significant overlap of patients can reasonably be expected, further complicating statistical study feasibility assessments and data gathering. Due to the sensitive nature of medical records and identifying data, data transfer and joint computations are often forbidden by law or associated with prohibitive amounts of effort. To alleviate this problem and to support rare disease research, we developed the Mainzelliste Secure EpiLinker (MainSEL) record linkage framework, a secure Multi-Party Computation based application using trusted-third-party-less cryptographic protocols to perform privacy-preserving record linkage with high security guarantees. In this work, we extend MainSEL to allow the record linkage based calculation of the number of common patients between institutions. This allows privacy-preserving statistical feasibility estimations for further analyses and data consolidation. Additionally, we created easy to deploy software packages using microservice containerization and continuous deployment/continuous integration. We performed tests with medical researchers using MainSEL in real-world medical IT environments, using synthetic patient data. Results We show that MainSEL achieves practical runtimes, performing 10 000 comparisons in approximately 5 minutes. Our approach proved to be feasible in a wide range of network settings and use cases. The “lessons learned” from the real-world testing show the need to explicitly support and document the usage and deployment for both analysis pipeline integration and researcher driven ad-hoc analysis use cases, thus clarifying the wide applicability of our software. MainSEL is freely available under: https://github.com/medicalinformatics/MainSEL Conclusions MainSEL performs well in real-world settings and is a useful tool not only for rare disease research, but medical research in general. It achieves practical runtimes, improved security guarantees compared to existing solutions, and is simple to deploy in strict clinical IT environments. Based on the “lessons learned” from the real-word testing, we hope to enable a wide range of medical researchers to meet their needs and requirements using modern privacy-preserving technologies.https://doi.org/10.1186/s12967-022-03671-6Multi-party computationRare diseaseIntersection cardinalityRecord linkageMedical informatics
spellingShingle Tobias Kussel
Torben Brenner
Galina Tremper
Josef Schepers
Martin Lablans
Kay Hamacher
Record linkage based patient intersection cardinality for rare disease studies using Mainzelliste and secure multi-party computation
Journal of Translational Medicine
Multi-party computation
Rare disease
Intersection cardinality
Record linkage
Medical informatics
title Record linkage based patient intersection cardinality for rare disease studies using Mainzelliste and secure multi-party computation
title_full Record linkage based patient intersection cardinality for rare disease studies using Mainzelliste and secure multi-party computation
title_fullStr Record linkage based patient intersection cardinality for rare disease studies using Mainzelliste and secure multi-party computation
title_full_unstemmed Record linkage based patient intersection cardinality for rare disease studies using Mainzelliste and secure multi-party computation
title_short Record linkage based patient intersection cardinality for rare disease studies using Mainzelliste and secure multi-party computation
title_sort record linkage based patient intersection cardinality for rare disease studies using mainzelliste and secure multi party computation
topic Multi-party computation
Rare disease
Intersection cardinality
Record linkage
Medical informatics
url https://doi.org/10.1186/s12967-022-03671-6
work_keys_str_mv AT tobiaskussel recordlinkagebasedpatientintersectioncardinalityforrarediseasestudiesusingmainzellisteandsecuremultipartycomputation
AT torbenbrenner recordlinkagebasedpatientintersectioncardinalityforrarediseasestudiesusingmainzellisteandsecuremultipartycomputation
AT galinatremper recordlinkagebasedpatientintersectioncardinalityforrarediseasestudiesusingmainzellisteandsecuremultipartycomputation
AT josefschepers recordlinkagebasedpatientintersectioncardinalityforrarediseasestudiesusingmainzellisteandsecuremultipartycomputation
AT martinlablans recordlinkagebasedpatientintersectioncardinalityforrarediseasestudiesusingmainzellisteandsecuremultipartycomputation
AT kayhamacher recordlinkagebasedpatientintersectioncardinalityforrarediseasestudiesusingmainzellisteandsecuremultipartycomputation