System for Semi-Automated Literature Review Based on Machine Learning
This paper presents the design and implementation of a system for semi-automating the literature review process based on machine learning. By using machine learning algorithms, the system determines whether scientific papers belong to the topic that is being explored as part of the review process. T...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2022-12-01
|
Series: | Electronics |
Subjects: | |
Online Access: | https://www.mdpi.com/2079-9292/11/24/4124 |
_version_ | 1797460085612478464 |
---|---|
author | Filip Bacinger Ivica Boticki Danijel Mlinaric |
author_facet | Filip Bacinger Ivica Boticki Danijel Mlinaric |
author_sort | Filip Bacinger |
collection | DOAJ |
description | This paper presents the design and implementation of a system for semi-automating the literature review process based on machine learning. By using machine learning algorithms, the system determines whether scientific papers belong to the topic that is being explored as part of the review process. The system’s user interface allows the process of creating a literature review to be managed through a series of steps: selecting data sources, building queries and topic searches, displaying the scientific papers found, selecting papers that belong to the set of desired papers, running machine learning algorithms for learning and automated classification, and displaying and exporting the final set of papers. Manual literature reviews are compared with automated reviews, and similarities and differences between the two approaches in terms of duration, accuracy, and ease of use are discussed. This study concludes that the best results in terms of sensitivity and accuracy for the automated literature review process are achieved by using a combined machine learning model, which uses multiple unweighted machine learning models. Cross-testing the models on two alternative datasets revealed an overlap in the machine learning hyperparameters. The stable sensitivity and accuracy in the tests indicate the potential for generalized use in automated literature review. |
first_indexed | 2024-03-09T17:01:06Z |
format | Article |
id | doaj.art-f6ae544b2c01475db49d67c2a7c6c4f9 |
institution | Directory Open Access Journal |
issn | 2079-9292 |
language | English |
last_indexed | 2024-03-09T17:01:06Z |
publishDate | 2022-12-01 |
publisher | MDPI AG |
record_format | Article |
series | Electronics |
spelling | doaj.art-f6ae544b2c01475db49d67c2a7c6c4f92023-11-24T14:30:41ZengMDPI AGElectronics2079-92922022-12-011124412410.3390/electronics11244124System for Semi-Automated Literature Review Based on Machine LearningFilip Bacinger0Ivica Boticki1Danijel Mlinaric2Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10 000 Zagreb, CroatiaFaculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10 000 Zagreb, CroatiaFaculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10 000 Zagreb, CroatiaThis paper presents the design and implementation of a system for semi-automating the literature review process based on machine learning. By using machine learning algorithms, the system determines whether scientific papers belong to the topic that is being explored as part of the review process. The system’s user interface allows the process of creating a literature review to be managed through a series of steps: selecting data sources, building queries and topic searches, displaying the scientific papers found, selecting papers that belong to the set of desired papers, running machine learning algorithms for learning and automated classification, and displaying and exporting the final set of papers. Manual literature reviews are compared with automated reviews, and similarities and differences between the two approaches in terms of duration, accuracy, and ease of use are discussed. This study concludes that the best results in terms of sensitivity and accuracy for the automated literature review process are achieved by using a combined machine learning model, which uses multiple unweighted machine learning models. Cross-testing the models on two alternative datasets revealed an overlap in the machine learning hyperparameters. The stable sensitivity and accuracy in the tests indicate the potential for generalized use in automated literature review.https://www.mdpi.com/2079-9292/11/24/4124automating literature reviewmachine learningnatural language processing |
spellingShingle | Filip Bacinger Ivica Boticki Danijel Mlinaric System for Semi-Automated Literature Review Based on Machine Learning Electronics automating literature review machine learning natural language processing |
title | System for Semi-Automated Literature Review Based on Machine Learning |
title_full | System for Semi-Automated Literature Review Based on Machine Learning |
title_fullStr | System for Semi-Automated Literature Review Based on Machine Learning |
title_full_unstemmed | System for Semi-Automated Literature Review Based on Machine Learning |
title_short | System for Semi-Automated Literature Review Based on Machine Learning |
title_sort | system for semi automated literature review based on machine learning |
topic | automating literature review machine learning natural language processing |
url | https://www.mdpi.com/2079-9292/11/24/4124 |
work_keys_str_mv | AT filipbacinger systemforsemiautomatedliteraturereviewbasedonmachinelearning AT ivicaboticki systemforsemiautomatedliteraturereviewbasedonmachinelearning AT danijelmlinaric systemforsemiautomatedliteraturereviewbasedonmachinelearning |