System for Semi-Automated Literature Review Based on Machine Learning

This paper presents the design and implementation of a system for semi-automating the literature review process based on machine learning. By using machine learning algorithms, the system determines whether scientific papers belong to the topic that is being explored as part of the review process. T...

Full description

Bibliographic Details
Main Authors: Filip Bacinger, Ivica Boticki, Danijel Mlinaric
Format: Article
Language:English
Published: MDPI AG 2022-12-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/11/24/4124
_version_ 1797460085612478464
author Filip Bacinger
Ivica Boticki
Danijel Mlinaric
author_facet Filip Bacinger
Ivica Boticki
Danijel Mlinaric
author_sort Filip Bacinger
collection DOAJ
description This paper presents the design and implementation of a system for semi-automating the literature review process based on machine learning. By using machine learning algorithms, the system determines whether scientific papers belong to the topic that is being explored as part of the review process. The system’s user interface allows the process of creating a literature review to be managed through a series of steps: selecting data sources, building queries and topic searches, displaying the scientific papers found, selecting papers that belong to the set of desired papers, running machine learning algorithms for learning and automated classification, and displaying and exporting the final set of papers. Manual literature reviews are compared with automated reviews, and similarities and differences between the two approaches in terms of duration, accuracy, and ease of use are discussed. This study concludes that the best results in terms of sensitivity and accuracy for the automated literature review process are achieved by using a combined machine learning model, which uses multiple unweighted machine learning models. Cross-testing the models on two alternative datasets revealed an overlap in the machine learning hyperparameters. The stable sensitivity and accuracy in the tests indicate the potential for generalized use in automated literature review.
first_indexed 2024-03-09T17:01:06Z
format Article
id doaj.art-f6ae544b2c01475db49d67c2a7c6c4f9
institution Directory Open Access Journal
issn 2079-9292
language English
last_indexed 2024-03-09T17:01:06Z
publishDate 2022-12-01
publisher MDPI AG
record_format Article
series Electronics
spelling doaj.art-f6ae544b2c01475db49d67c2a7c6c4f92023-11-24T14:30:41ZengMDPI AGElectronics2079-92922022-12-011124412410.3390/electronics11244124System for Semi-Automated Literature Review Based on Machine LearningFilip Bacinger0Ivica Boticki1Danijel Mlinaric2Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10 000 Zagreb, CroatiaFaculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10 000 Zagreb, CroatiaFaculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10 000 Zagreb, CroatiaThis paper presents the design and implementation of a system for semi-automating the literature review process based on machine learning. By using machine learning algorithms, the system determines whether scientific papers belong to the topic that is being explored as part of the review process. The system’s user interface allows the process of creating a literature review to be managed through a series of steps: selecting data sources, building queries and topic searches, displaying the scientific papers found, selecting papers that belong to the set of desired papers, running machine learning algorithms for learning and automated classification, and displaying and exporting the final set of papers. Manual literature reviews are compared with automated reviews, and similarities and differences between the two approaches in terms of duration, accuracy, and ease of use are discussed. This study concludes that the best results in terms of sensitivity and accuracy for the automated literature review process are achieved by using a combined machine learning model, which uses multiple unweighted machine learning models. Cross-testing the models on two alternative datasets revealed an overlap in the machine learning hyperparameters. The stable sensitivity and accuracy in the tests indicate the potential for generalized use in automated literature review.https://www.mdpi.com/2079-9292/11/24/4124automating literature reviewmachine learningnatural language processing
spellingShingle Filip Bacinger
Ivica Boticki
Danijel Mlinaric
System for Semi-Automated Literature Review Based on Machine Learning
Electronics
automating literature review
machine learning
natural language processing
title System for Semi-Automated Literature Review Based on Machine Learning
title_full System for Semi-Automated Literature Review Based on Machine Learning
title_fullStr System for Semi-Automated Literature Review Based on Machine Learning
title_full_unstemmed System for Semi-Automated Literature Review Based on Machine Learning
title_short System for Semi-Automated Literature Review Based on Machine Learning
title_sort system for semi automated literature review based on machine learning
topic automating literature review
machine learning
natural language processing
url https://www.mdpi.com/2079-9292/11/24/4124
work_keys_str_mv AT filipbacinger systemforsemiautomatedliteraturereviewbasedonmachinelearning
AT ivicaboticki systemforsemiautomatedliteraturereviewbasedonmachinelearning
AT danijelmlinaric systemforsemiautomatedliteraturereviewbasedonmachinelearning