A Machine Learning Approach for the NLP-Based Analysis of Cyber Threats and Vulnerabilities of the Healthcare Ecosystem
Digitization in healthcare systems, with the wid adoption of Electronic Health Records, connected medical devices, software and systems providing efficient healthcare service delivery and management. On the other hand, the use of these systems has significantly increased cyber threats in the healthc...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-01-01
|
Series: | Sensors |
Subjects: | |
Online Access: | https://www.mdpi.com/1424-8220/23/2/651 |
_version_ | 1797437374202904576 |
---|---|
author | Stefano Silvestri Shareeful Islam Spyridon Papastergiou Christos Tzagkarakis Mario Ciampi |
author_facet | Stefano Silvestri Shareeful Islam Spyridon Papastergiou Christos Tzagkarakis Mario Ciampi |
author_sort | Stefano Silvestri |
collection | DOAJ |
description | Digitization in healthcare systems, with the wid adoption of Electronic Health Records, connected medical devices, software and systems providing efficient healthcare service delivery and management. On the other hand, the use of these systems has significantly increased cyber threats in the healthcare sector. Vulnerabilities in the existing and legacy systems are one of the key causes for the threats and related risks. Understanding and addressing the threats from the connected medical devices and other parts of the ICT health infrastructure are of paramount importance for ensuring security within the overall healthcare ecosystem. Threat and vulnerability analysis provides an effective way to lower the impact of risks relating to the existing vulnerabilities. However, this is a challenging task due to the availability of massive data which makes it difficult to identify potential patterns of security issues. This paper contributes towards an effective threats and vulnerabilities analysis by adopting Machine Learning models, such as the BERT neural language model and XGBoost, to extract updated information from the Natural Language documents largely available on the web, evaluating at the same time the level of the identified threats and vulnerabilities that can impact on the healthcare system, providing the required information for the most appropriate management of the risk. Experiments were performed based on CS news extracted from the Hacker News website and on Common Vulnerabilities and Exposures (CVE) vulnerability reports. The results demonstrate the effectiveness of the proposed approach, which provides a realistic manner to assess the threats and vulnerabilities from Natural Language texts, allowing adopting it in real-world Healthcare ecosystems. |
first_indexed | 2024-03-09T11:18:21Z |
format | Article |
id | doaj.art-85b480ade300442d82ca1adc543913ed |
institution | Directory Open Access Journal |
issn | 1424-8220 |
language | English |
last_indexed | 2024-03-09T11:18:21Z |
publishDate | 2023-01-01 |
publisher | MDPI AG |
record_format | Article |
series | Sensors |
spelling | doaj.art-85b480ade300442d82ca1adc543913ed2023-12-01T00:25:14ZengMDPI AGSensors1424-82202023-01-0123265110.3390/s23020651A Machine Learning Approach for the NLP-Based Analysis of Cyber Threats and Vulnerabilities of the Healthcare EcosystemStefano Silvestri0Shareeful Islam1Spyridon Papastergiou2Christos Tzagkarakis3Mario Ciampi4Institute for High Performance Computing and Networking, National Research Council of Italy (ICAR-CNR), Via Pietro Castellino 111, 80131 Naples, ItalySchool of Computing and Information Science, Anglia Ruskin University, Cambridge CB1 1PT, UKFocal Point, 1410 Waterloo, BelgiumFocal Point, 1410 Waterloo, BelgiumInstitute for High Performance Computing and Networking, National Research Council of Italy (ICAR-CNR), Via Pietro Castellino 111, 80131 Naples, ItalyDigitization in healthcare systems, with the wid adoption of Electronic Health Records, connected medical devices, software and systems providing efficient healthcare service delivery and management. On the other hand, the use of these systems has significantly increased cyber threats in the healthcare sector. Vulnerabilities in the existing and legacy systems are one of the key causes for the threats and related risks. Understanding and addressing the threats from the connected medical devices and other parts of the ICT health infrastructure are of paramount importance for ensuring security within the overall healthcare ecosystem. Threat and vulnerability analysis provides an effective way to lower the impact of risks relating to the existing vulnerabilities. However, this is a challenging task due to the availability of massive data which makes it difficult to identify potential patterns of security issues. This paper contributes towards an effective threats and vulnerabilities analysis by adopting Machine Learning models, such as the BERT neural language model and XGBoost, to extract updated information from the Natural Language documents largely available on the web, evaluating at the same time the level of the identified threats and vulnerabilities that can impact on the healthcare system, providing the required information for the most appropriate management of the risk. Experiments were performed based on CS news extracted from the Hacker News website and on Common Vulnerabilities and Exposures (CVE) vulnerability reports. The results demonstrate the effectiveness of the proposed approach, which provides a realistic manner to assess the threats and vulnerabilities from Natural Language texts, allowing adopting it in real-world Healthcare ecosystems.https://www.mdpi.com/1424-8220/23/2/651healthcare ecosystemcyber threatscyber vulnerabilitieshealthcare information infrastructurenatural language processingmachine learning |
spellingShingle | Stefano Silvestri Shareeful Islam Spyridon Papastergiou Christos Tzagkarakis Mario Ciampi A Machine Learning Approach for the NLP-Based Analysis of Cyber Threats and Vulnerabilities of the Healthcare Ecosystem Sensors healthcare ecosystem cyber threats cyber vulnerabilities healthcare information infrastructure natural language processing machine learning |
title | A Machine Learning Approach for the NLP-Based Analysis of Cyber Threats and Vulnerabilities of the Healthcare Ecosystem |
title_full | A Machine Learning Approach for the NLP-Based Analysis of Cyber Threats and Vulnerabilities of the Healthcare Ecosystem |
title_fullStr | A Machine Learning Approach for the NLP-Based Analysis of Cyber Threats and Vulnerabilities of the Healthcare Ecosystem |
title_full_unstemmed | A Machine Learning Approach for the NLP-Based Analysis of Cyber Threats and Vulnerabilities of the Healthcare Ecosystem |
title_short | A Machine Learning Approach for the NLP-Based Analysis of Cyber Threats and Vulnerabilities of the Healthcare Ecosystem |
title_sort | machine learning approach for the nlp based analysis of cyber threats and vulnerabilities of the healthcare ecosystem |
topic | healthcare ecosystem cyber threats cyber vulnerabilities healthcare information infrastructure natural language processing machine learning |
url | https://www.mdpi.com/1424-8220/23/2/651 |
work_keys_str_mv | AT stefanosilvestri amachinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem AT shareefulislam amachinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem AT spyridonpapastergiou amachinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem AT christostzagkarakis amachinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem AT mariociampi amachinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem AT stefanosilvestri machinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem AT shareefulislam machinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem AT spyridonpapastergiou machinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem AT christostzagkarakis machinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem AT mariociampi machinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem |