Clinical records anonymisation and text extraction (CRATE): an open-source software system

Abstract Background Electronic medical records contain information of value for research, but contain identifiable and often highly sensitive confidential information. Patient-identifiable information cannot in general be shared outside clinical care teams without explicit consent, but anonymisation...

Full description

Bibliographic Details
Main Author: Rudolf N. Cardinal
Format: Article
Language:English
Published: BMC 2017-04-01
Series:BMC Medical Informatics and Decision Making
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12911-017-0437-1
_version_ 1819177838824128512
author Rudolf N. Cardinal
author_facet Rudolf N. Cardinal
author_sort Rudolf N. Cardinal
collection DOAJ
description Abstract Background Electronic medical records contain information of value for research, but contain identifiable and often highly sensitive confidential information. Patient-identifiable information cannot in general be shared outside clinical care teams without explicit consent, but anonymisation/de-identification allows research uses of clinical data without explicit consent. Results This article presents CRATE (Clinical Records Anonymisation and Text Extraction), an open-source software system with separable functions: (1) it anonymises or de-identifies arbitrary relational databases, with sensitivity and precision similar to previous comparable systems; (2) it uses public secure cryptographic methods to map patient identifiers to research identifiers (pseudonyms); (3) it connects relational databases to external tools for natural language processing; (4) it provides a web front end for research and administrative functions; and (5) it supports a specific model through which patients may consent to be contacted about research. Conclusions Creation and management of a research database from sensitive clinical records with secure pseudonym generation, full-text indexing, and a consent-to-contact process is possible and practical using entirely free and open-source software.
first_indexed 2024-12-22T21:33:01Z
format Article
id doaj.art-3263fb23b6504cedb56d9ea757d38660
institution Directory Open Access Journal
issn 1472-6947
language English
last_indexed 2024-12-22T21:33:01Z
publishDate 2017-04-01
publisher BMC
record_format Article
series BMC Medical Informatics and Decision Making
spelling doaj.art-3263fb23b6504cedb56d9ea757d386602022-12-21T18:11:51ZengBMCBMC Medical Informatics and Decision Making1472-69472017-04-0117111210.1186/s12911-017-0437-1Clinical records anonymisation and text extraction (CRATE): an open-source software systemRudolf N. Cardinal0Behavioural and Clinical Neuroscience Institute, Department of Psychiatry, University of CambridgeAbstract Background Electronic medical records contain information of value for research, but contain identifiable and often highly sensitive confidential information. Patient-identifiable information cannot in general be shared outside clinical care teams without explicit consent, but anonymisation/de-identification allows research uses of clinical data without explicit consent. Results This article presents CRATE (Clinical Records Anonymisation and Text Extraction), an open-source software system with separable functions: (1) it anonymises or de-identifies arbitrary relational databases, with sensitivity and precision similar to previous comparable systems; (2) it uses public secure cryptographic methods to map patient identifiers to research identifiers (pseudonyms); (3) it connects relational databases to external tools for natural language processing; (4) it provides a web front end for research and administrative functions; and (5) it supports a specific model through which patients may consent to be contacted about research. Conclusions Creation and management of a research database from sensitive clinical records with secure pseudonym generation, full-text indexing, and a consent-to-contact process is possible and practical using entirely free and open-source software.http://link.springer.com/article/10.1186/s12911-017-0437-1AnonymisationDe-identificationClinical informaticsElectronic medical recordsOpen-source softwarePseudonymisation
spellingShingle Rudolf N. Cardinal
Clinical records anonymisation and text extraction (CRATE): an open-source software system
BMC Medical Informatics and Decision Making
Anonymisation
De-identification
Clinical informatics
Electronic medical records
Open-source software
Pseudonymisation
title Clinical records anonymisation and text extraction (CRATE): an open-source software system
title_full Clinical records anonymisation and text extraction (CRATE): an open-source software system
title_fullStr Clinical records anonymisation and text extraction (CRATE): an open-source software system
title_full_unstemmed Clinical records anonymisation and text extraction (CRATE): an open-source software system
title_short Clinical records anonymisation and text extraction (CRATE): an open-source software system
title_sort clinical records anonymisation and text extraction crate an open source software system
topic Anonymisation
De-identification
Clinical informatics
Electronic medical records
Open-source software
Pseudonymisation
url http://link.springer.com/article/10.1186/s12911-017-0437-1
work_keys_str_mv AT rudolfncardinal clinicalrecordsanonymisationandtextextractioncrateanopensourcesoftwaresystem