A scoping review of preprocessing methods for unstructured text data to assess data quality

Introduction Unstructured text data (UTD) are increasingly found in many databases that were never intended to be used for research, including electronic medical record (EMR) databases. Data quality can impact the usefulness of UTD for research. UTD are typically prepared for analysis (i.e., preproc...

Full description

Bibliographic Details
Main Authors: Marcello Nesca, Alan Katz, Carson Leung, Lisa Lix
Format: Article
Language:English
Published: Swansea University 2022-10-01
Series:International Journal of Population Data Science
Subjects:
Online Access:https://ijpds.org/article/view/1757

Similar Items