A scoping review of preprocessing methods for unstructured text data to assess data quality
Introduction Unstructured text data (UTD) are increasingly found in many databases that were never intended to be used for research, including electronic medical record (EMR) databases. Data quality can impact the usefulness of UTD for research. UTD are typically prepared for analysis (i.e., preproc...
Main Authors: | Marcello Nesca, Alan Katz, Carson Leung, Lisa Lix |
---|---|
Format: | Article |
Language: | English |
Published: |
Swansea University
2022-10-01
|
Series: | International Journal of Population Data Science |
Subjects: | |
Online Access: | https://ijpds.org/article/view/1757 |
Similar Items
-
Unstructured Data Analysis for Risk Management of Electric Power Transmission Lines
by: Lucas H. Pereira, et al.
Published: (2022-05-01) -
Text to Causal Knowledge Graph: A Framework to Synthesize Knowledge from Unstructured Business Texts into Causal Graphs
by: Seethalakshmi Gopalakrishnan, et al.
Published: (2023-06-01) -
Usability enhancement model for unstructured text in big data
by: Kiran Adnan, et al.
Published: (2023-11-01) -
Connecting Text Classification with Image Classification: A New Preprocessing Method for Implicit Sentiment Text Classification
by: Meikang Chen, et al.
Published: (2022-02-01) -
Pan-Canadian Electronic Medical Record Diagnostic and Unstructured Text Data for Capturing PTSD: Retrospective Observational Study
by: Leanne Kosowan, et al.
Published: (2022-12-01)