Healthcare related event prediction from textual data with machine learning: A Systematic Literature Review

In the field of healthcare, as well as many others, textual descriptions of events are logged. With the use of Natural Language Processing (NLP), these texts are used to train event prediction machine learning algorithms. In this review the aim was to assess the state-of-the-art within current liter...

Full description

Bibliographic Details
Main Authors: Oscar Hoekstra, William Hurst, Joep Tummers
Format: Article
Language:English
Published: Elsevier 2022-11-01
Series:Healthcare Analytics
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2772442522000478
_version_ 1811201493574877184
author Oscar Hoekstra
William Hurst
Joep Tummers
author_facet Oscar Hoekstra
William Hurst
Joep Tummers
author_sort Oscar Hoekstra
collection DOAJ
description In the field of healthcare, as well as many others, textual descriptions of events are logged. With the use of Natural Language Processing (NLP), these texts are used to train event prediction machine learning algorithms. In this review the aim was to assess the state-of-the-art within current literature concerning prediction of events on textual records. Thus, this study follows a standard Systematic Literature Review (SLR) process. Primary articles are selected from PubMed, IEEE and WebOfScience with a search query, and then exclusion and quality assessment criteria are used to select the articles that are relevant to this study. Published performance metrics for the prediction algorithms used in the studies were then extracted from the included articles and used to assess the different methods. The general-purpose neural network algorithms: Convolutional Neural Networks (CNNs), Long Short-Term Memory (LSTM) and Conditional Random Fields (CRF) demonstrate the highest F1-scores amongst all the methods in this review, of 98.5%, 98% and 90.13% respectively. The algorithms that were designed specifically for NLP such as word2vec and BERT also performed well with F1-scores of 88.93% and 91.50%. This review does not give a comparison between methods but gives an indication about which machine learning methods perform well according to the authors of the selected studies. Not enough performance results are published under comparable circumstances to give conclusive results about which methods perform the best. More research needs to be done in comparing algorithms on the same dataset to proof the performance of the methods.
first_indexed 2024-04-12T02:22:34Z
format Article
id doaj.art-a743c57272c44cca954885310120e944
institution Directory Open Access Journal
issn 2772-4425
language English
last_indexed 2024-04-12T02:22:34Z
publishDate 2022-11-01
publisher Elsevier
record_format Article
series Healthcare Analytics
spelling doaj.art-a743c57272c44cca954885310120e9442022-12-22T03:52:05ZengElsevierHealthcare Analytics2772-44252022-11-012100107Healthcare related event prediction from textual data with machine learning: A Systematic Literature ReviewOscar Hoekstra0William Hurst1Joep Tummers2Information Technology Group, Wageningen University & Research, Hollandseweg 1, 6706 KN Wageningen, The NetherlandsCorresponding author.; Information Technology Group, Wageningen University & Research, Hollandseweg 1, 6706 KN Wageningen, The NetherlandsInformation Technology Group, Wageningen University & Research, Hollandseweg 1, 6706 KN Wageningen, The NetherlandsIn the field of healthcare, as well as many others, textual descriptions of events are logged. With the use of Natural Language Processing (NLP), these texts are used to train event prediction machine learning algorithms. In this review the aim was to assess the state-of-the-art within current literature concerning prediction of events on textual records. Thus, this study follows a standard Systematic Literature Review (SLR) process. Primary articles are selected from PubMed, IEEE and WebOfScience with a search query, and then exclusion and quality assessment criteria are used to select the articles that are relevant to this study. Published performance metrics for the prediction algorithms used in the studies were then extracted from the included articles and used to assess the different methods. The general-purpose neural network algorithms: Convolutional Neural Networks (CNNs), Long Short-Term Memory (LSTM) and Conditional Random Fields (CRF) demonstrate the highest F1-scores amongst all the methods in this review, of 98.5%, 98% and 90.13% respectively. The algorithms that were designed specifically for NLP such as word2vec and BERT also performed well with F1-scores of 88.93% and 91.50%. This review does not give a comparison between methods but gives an indication about which machine learning methods perform well according to the authors of the selected studies. Not enough performance results are published under comparable circumstances to give conclusive results about which methods perform the best. More research needs to be done in comparing algorithms on the same dataset to proof the performance of the methods.http://www.sciencedirect.com/science/article/pii/S2772442522000478Systematic literature reviewEvent predictionNatural Language ProcessingTextual dataMachine LearningHealthcare
spellingShingle Oscar Hoekstra
William Hurst
Joep Tummers
Healthcare related event prediction from textual data with machine learning: A Systematic Literature Review
Healthcare Analytics
Systematic literature review
Event prediction
Natural Language Processing
Textual data
Machine Learning
Healthcare
title Healthcare related event prediction from textual data with machine learning: A Systematic Literature Review
title_full Healthcare related event prediction from textual data with machine learning: A Systematic Literature Review
title_fullStr Healthcare related event prediction from textual data with machine learning: A Systematic Literature Review
title_full_unstemmed Healthcare related event prediction from textual data with machine learning: A Systematic Literature Review
title_short Healthcare related event prediction from textual data with machine learning: A Systematic Literature Review
title_sort healthcare related event prediction from textual data with machine learning a systematic literature review
topic Systematic literature review
Event prediction
Natural Language Processing
Textual data
Machine Learning
Healthcare
url http://www.sciencedirect.com/science/article/pii/S2772442522000478
work_keys_str_mv AT oscarhoekstra healthcarerelatedeventpredictionfromtextualdatawithmachinelearningasystematicliteraturereview
AT williamhurst healthcarerelatedeventpredictionfromtextualdatawithmachinelearningasystematicliteraturereview
AT joeptummers healthcarerelatedeventpredictionfromtextualdatawithmachinelearningasystematicliteraturereview