Generating Synthetic Training Data for Supervised De-Identification of Electronic Health Records

Generating Synthetic Training Data for Supervised De-Identification of Electronic Health Records

A major hurdle in the development of natural language processing (NLP) methods for Electronic Health Records (EHRs) is the lack of large, annotated datasets. Privacy concerns prevent the distribution of EHRs, and the annotation of data is known to be costly and cumbersome. Synthetic data presents a...

Full description

Bibliographic Details
Main Authors:	Claudia Alessandra Libbi, Jan Trienes, Dolf Trieschnigg, Christin Seifert
Format:	Article
Language:	English
Published:	MDPI AG 2021-05-01
Series:	Future Internet
Subjects:	natural language processing medical records privacy protection synthetic text generative language models named-entity recognition
Online Access:	https://www.mdpi.com/1999-5903/13/5/136

Similar Items

NERSkill.Id: Annotated dataset of Indonesian's skill entity recognition
by: Meilany Nonsi Tentua, et al.
Published: (2024-04-01)

Named Entity Recognition Utilized to Enhance Text Classification While Preserving Privacy
by: Mohammed Kutbi
Published: (2023-01-01)

Deep learning with language models improves named entity recognition for PharmaCoNER
by: Cong Sun, et al.
Published: (2021-12-01)

Natural Language Processing to Extract Information from Portuguese-Language Medical Records
by: Naila Camila da Rocha, et al.
Published: (2022-12-01)

Thai Named Entity Recognition Using BiLSTM-CNN-CRF Enhanced by TCC
by: Virach Sornlertlamvanich, et al.
Published: (2022-01-01)

Entity Linking Method for Chinese Short Text Based on Siamese-Like Network
by: Yang Zhang, et al.
Published: (2022-08-01)

Leveraging the potential of synthetic text for AI in mental healthcare
by: Julia Ive
Published: (2022-10-01)

A Survey on Recent Named Entity Recognition and Relationship Extraction Techniques on Clinical Texts
by: Priyankar Bose, et al.
Published: (2021-09-01)

GWBNER: A named entity recognition method based on character glyph and word boundary features for Chinese EHRs
by: Jinsong Zhang, et al.
Published: (2023-09-01)

Exploring named entity recognition and relation extraction for ontology and medical records integration
by: Diego Pinheiro da Silva, et al.
Published: (2023-01-01)

Methods of extracting biomedical information from patents and scientific publications (on the example of chemical compounds)
by: Nikolay A. Kolpakov, et al.
Published: (2023-03-01)

DarNERcorp: An annotated named entity recognition dataset in the Moroccan dialect
by: Hanane Nour Moussa, et al.
Published: (2023-06-01)

Systematic Literature Review of Information Extraction From Textual Data: Recent Methods, Applications, Trends, and Challenges
by: Mohd Hafizul Afifi Abdullah, et al.
Published: (2023-01-01)

An RG-FLAT-CRF Model for Named Entity Recognition of Chinese Electronic Clinical Records
by: Jiakang Li, et al.
Published: (2022-04-01)

MetaboListem and TABoLiSTM: Two Deep Learning Algorithms for Metabolite Named Entity Recognition
by: Cheng S. Yeung, et al.
Published: (2022-03-01)

Improving spaCy dependency annotation and PoS tagging web service using independent NER services
by: Nico Colic, et al.
Published: (2019-06-01)

News Image-Text Matching With News Knowledge Graph
by: Zhao Yumeng, et al.
Published: (2021-01-01)

Named Entity Recognition for Sensitive Data Discovery in Portuguese
by: Mariana Dias, et al.
Published: (2020-03-01)

Web Interface of NER and RE with BERT for Biomedical Text Mining
by: Yeon-Ji Park, et al.
Published: (2023-04-01)

Robust Chinese Named Entity Recognition Based on Fusion Graph Embedding
by: Xuhui Song, et al.
Published: (2023-01-01)

Parallel-Based Corpus Annotation for Malay Health Documents
by: Hafsah, et al.
Published: (2023-12-01)

On the Use of Parsing for Named Entity Recognition
by: Miguel A. Alonso, et al.
Published: (2021-01-01)

Learning the Morphological and Syntactic Grammars for Named Entity Recognition
by: Mengtao Sun, et al.
Published: (2022-01-01)

MLM-based typographical error correction of unstructured medical texts for named entity recognition
by: Eun Byul Lee, et al.
Published: (2022-11-01)

SMPT: A Semi-Supervised Multi-Model Prediction Technique for Food Ingredient Named Entity Recognition (FINER) Dataset Construction
by: Kokoy Siti Komariah, et al.
Published: (2023-01-01)

Adaptive Geoparsing Method for Toponym Recognition and Resolution in Unstructured Text
by: Edwin Aldana-Bobadilla, et al.
Published: (2020-09-01)

Chinese medical entity recognition based on the dual-branch TENER model
by: Hui Peng, et al.
Published: (2023-07-01)

PNER: Applying the Pipeline Method to Resolve Nested Issues in Named Entity Recognition
by: Hongjian Yang, et al.
Published: (2024-02-01)

Clinical concept recognition: Evaluation of existing systems on EHRs
by: Juan Antonio Lossio-Ventura, et al.
Published: (2023-01-01)

GPDminer: a tool for extracting named entities and analyzing relations in biological literature
by: Yeon-Ji Park, et al.
Published: (2024-03-01)

Ontology Attention Layer for Medical Named Entity Recognition
by: Yue Zha, et al.
Published: (2024-01-01)

A BERT-Span model for Chinese named entity recognition in rehabilitation medicine
by: Jinhong Zhong, et al.
Published: (2023-08-01)

Chinese Fine‐Grained Geological Named Entity Recognition With Rules and FLAT
by: Siying Chen, et al.
Published: (2022-12-01)

Fusion of SoftLexicon and RoBERTa for Purpose-Driven Electronic Medical Record Named Entity Recognition
by: Xiaohui Cui, et al.
Published: (2023-12-01)

Concept recognition as a machine translation problem
by: Mayla R. Boguslav, et al.
Published: (2021-12-01)

Named entity recognition of Chinese electronic medical records based on a hybrid neural network and medical MC-BERT
by: Peng Chen, et al.
Published: (2022-12-01)

Named entity recognition method based on joint entity boundary detection
by: Xiaoteng LI, et al.
Published: (2023-02-01)

Named Entity Recognition Datasets: A Classification Framework
by: Ying Zhang, et al.
Published: (2024-03-01)

An ensemble deep learning model to enhance feature representation for entity detection
by: Elham Parsaeimehr, et al.
Published: (2022-06-01)

Metamorphic testing of named entity recognition systems: A case study
by: Yezi Xu, et al.
Published: (2022-08-01)