A Machine Learning Approach for the NLP-Based Analysis of Cyber Threats and Vulnerabilities of the Healthcare Ecosystem

Digitization in healthcare systems, with the wid adoption of Electronic Health Records, connected medical devices, software and systems providing efficient healthcare service delivery and management. On the other hand, the use of these systems has significantly increased cyber threats in the healthc...

Full description

Bibliographic Details
Main Authors: Stefano Silvestri, Shareeful Islam, Spyridon Papastergiou, Christos Tzagkarakis, Mario Ciampi
Format: Article
Language:English
Published: MDPI AG 2023-01-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/23/2/651
_version_ 1797437374202904576
author Stefano Silvestri
Shareeful Islam
Spyridon Papastergiou
Christos Tzagkarakis
Mario Ciampi
author_facet Stefano Silvestri
Shareeful Islam
Spyridon Papastergiou
Christos Tzagkarakis
Mario Ciampi
author_sort Stefano Silvestri
collection DOAJ
description Digitization in healthcare systems, with the wid adoption of Electronic Health Records, connected medical devices, software and systems providing efficient healthcare service delivery and management. On the other hand, the use of these systems has significantly increased cyber threats in the healthcare sector. Vulnerabilities in the existing and legacy systems are one of the key causes for the threats and related risks. Understanding and addressing the threats from the connected medical devices and other parts of the ICT health infrastructure are of paramount importance for ensuring security within the overall healthcare ecosystem. Threat and vulnerability analysis provides an effective way to lower the impact of risks relating to the existing vulnerabilities. However, this is a challenging task due to the availability of massive data which makes it difficult to identify potential patterns of security issues. This paper contributes towards an effective threats and vulnerabilities analysis by adopting Machine Learning models, such as the BERT neural language model and XGBoost, to extract updated information from the Natural Language documents largely available on the web, evaluating at the same time the level of the identified threats and vulnerabilities that can impact on the healthcare system, providing the required information for the most appropriate management of the risk. Experiments were performed based on CS news extracted from the Hacker News website and on Common Vulnerabilities and Exposures (CVE) vulnerability reports. The results demonstrate the effectiveness of the proposed approach, which provides a realistic manner to assess the threats and vulnerabilities from Natural Language texts, allowing adopting it in real-world Healthcare ecosystems.
first_indexed 2024-03-09T11:18:21Z
format Article
id doaj.art-85b480ade300442d82ca1adc543913ed
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-09T11:18:21Z
publishDate 2023-01-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-85b480ade300442d82ca1adc543913ed2023-12-01T00:25:14ZengMDPI AGSensors1424-82202023-01-0123265110.3390/s23020651A Machine Learning Approach for the NLP-Based Analysis of Cyber Threats and Vulnerabilities of the Healthcare EcosystemStefano Silvestri0Shareeful Islam1Spyridon Papastergiou2Christos Tzagkarakis3Mario Ciampi4Institute for High Performance Computing and Networking, National Research Council of Italy (ICAR-CNR), Via Pietro Castellino 111, 80131 Naples, ItalySchool of Computing and Information Science, Anglia Ruskin University, Cambridge CB1 1PT, UKFocal Point, 1410 Waterloo, BelgiumFocal Point, 1410 Waterloo, BelgiumInstitute for High Performance Computing and Networking, National Research Council of Italy (ICAR-CNR), Via Pietro Castellino 111, 80131 Naples, ItalyDigitization in healthcare systems, with the wid adoption of Electronic Health Records, connected medical devices, software and systems providing efficient healthcare service delivery and management. On the other hand, the use of these systems has significantly increased cyber threats in the healthcare sector. Vulnerabilities in the existing and legacy systems are one of the key causes for the threats and related risks. Understanding and addressing the threats from the connected medical devices and other parts of the ICT health infrastructure are of paramount importance for ensuring security within the overall healthcare ecosystem. Threat and vulnerability analysis provides an effective way to lower the impact of risks relating to the existing vulnerabilities. However, this is a challenging task due to the availability of massive data which makes it difficult to identify potential patterns of security issues. This paper contributes towards an effective threats and vulnerabilities analysis by adopting Machine Learning models, such as the BERT neural language model and XGBoost, to extract updated information from the Natural Language documents largely available on the web, evaluating at the same time the level of the identified threats and vulnerabilities that can impact on the healthcare system, providing the required information for the most appropriate management of the risk. Experiments were performed based on CS news extracted from the Hacker News website and on Common Vulnerabilities and Exposures (CVE) vulnerability reports. The results demonstrate the effectiveness of the proposed approach, which provides a realistic manner to assess the threats and vulnerabilities from Natural Language texts, allowing adopting it in real-world Healthcare ecosystems.https://www.mdpi.com/1424-8220/23/2/651healthcare ecosystemcyber threatscyber vulnerabilitieshealthcare information infrastructurenatural language processingmachine learning
spellingShingle Stefano Silvestri
Shareeful Islam
Spyridon Papastergiou
Christos Tzagkarakis
Mario Ciampi
A Machine Learning Approach for the NLP-Based Analysis of Cyber Threats and Vulnerabilities of the Healthcare Ecosystem
Sensors
healthcare ecosystem
cyber threats
cyber vulnerabilities
healthcare information infrastructure
natural language processing
machine learning
title A Machine Learning Approach for the NLP-Based Analysis of Cyber Threats and Vulnerabilities of the Healthcare Ecosystem
title_full A Machine Learning Approach for the NLP-Based Analysis of Cyber Threats and Vulnerabilities of the Healthcare Ecosystem
title_fullStr A Machine Learning Approach for the NLP-Based Analysis of Cyber Threats and Vulnerabilities of the Healthcare Ecosystem
title_full_unstemmed A Machine Learning Approach for the NLP-Based Analysis of Cyber Threats and Vulnerabilities of the Healthcare Ecosystem
title_short A Machine Learning Approach for the NLP-Based Analysis of Cyber Threats and Vulnerabilities of the Healthcare Ecosystem
title_sort machine learning approach for the nlp based analysis of cyber threats and vulnerabilities of the healthcare ecosystem
topic healthcare ecosystem
cyber threats
cyber vulnerabilities
healthcare information infrastructure
natural language processing
machine learning
url https://www.mdpi.com/1424-8220/23/2/651
work_keys_str_mv AT stefanosilvestri amachinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem
AT shareefulislam amachinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem
AT spyridonpapastergiou amachinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem
AT christostzagkarakis amachinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem
AT mariociampi amachinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem
AT stefanosilvestri machinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem
AT shareefulislam machinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem
AT spyridonpapastergiou machinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem
AT christostzagkarakis machinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem
AT mariociampi machinelearningapproachforthenlpbasedanalysisofcyberthreatsandvulnerabilitiesofthehealthcareecosystem