Automated de-identification of free-text medical records

Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006.

Bibliographic Details
Main Author:	Neamatullah, Ishna
Other Authors:	Roger G. Mark.
Format:	Thesis
Language:	eng
Published:	Massachusetts Institute of Technology 2008
Subjects:	Electrical Engineering and Computer Science.
Online Access:	http://hdl.handle.net/1721.1/41622

_version_	1826217469263478784
author	Neamatullah, Ishna
author2	Roger G. Mark.
author_facet	Roger G. Mark. Neamatullah, Ishna
author_sort	Neamatullah, Ishna
collection	MIT
description	Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006.
first_indexed	2024-09-23T17:04:09Z
format	Thesis
id	mit-1721.1/41622
institution	Massachusetts Institute of Technology
language	eng
last_indexed	2024-09-23T17:04:09Z
publishDate	2008
publisher	Massachusetts Institute of Technology
record_format	dspace
spelling	mit-1721.1/416222019-04-12T16:07:53Z Automated de-identification of free-text medical records Neamatullah, Ishna Roger G. Mark. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006. Includes bibliographical references (p. 62-64). This paper presents a de-identification study at the Harvard-MIT Division of Health Science and Technology (HST) to automatically de-identify confidential patient information from text medical records used in intensive care units (ICUs). Patient records are a vital resource in medical research. Before such records can be made available for research studies, protected health information (PHI) must be thoroughly scrubbed according to HIPAA specifications to preserve patient confidentiality. Manual de-identification on large databases tends to be prohibitively expensive, time-consuming and prone to error, making a computerized algorithm an urgent need for large-scale de-identification purposes. We have developed an automated pattern-matching deidentification algorithm that uses medical and hospital-specific information. The current version of the algorithm has an overall sensitivity of around 0.87 and an approximate positive predictive value of 0.63. In terms of sensitivity, it performs significantly better than 1 person (0.81) but not quite as well as a consensus of 2 human de-identifiers (0.94). The algorithm will be published as open-source software, and the de-identified medical records will be incorporated into HST's Multi-Parameter Intelligent Monitoring for Intensive Care (MIMIC II) physiologic database. by Ishna Neamatullah. M.Eng. 2008-05-19T16:02:04Z 2008-05-19T16:02:04Z 2006 2006 Thesis http://hdl.handle.net/1721.1/41622 216883891 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 73 p. application/pdf Massachusetts Institute of Technology
spellingShingle	Electrical Engineering and Computer Science. Neamatullah, Ishna Automated de-identification of free-text medical records
title	Automated de-identification of free-text medical records
title_full	Automated de-identification of free-text medical records
title_fullStr	Automated de-identification of free-text medical records
title_full_unstemmed	Automated de-identification of free-text medical records
title_short	Automated de-identification of free-text medical records
title_sort	automated de identification of free text medical records
topic	Electrical Engineering and Computer Science.
url	http://hdl.handle.net/1721.1/41622
work_keys_str_mv	AT neamatullahishna automateddeidentificationoffreetextmedicalrecords

Automated de-identification of free-text medical records

Similar Items