Federated Learning for Intrusion Detection in the Critical Infrastructures: Vertically Partitioned Data Use Case

One of the challenges in the Internet of Things systems is the security of the critical data, for example, data used for intrusion detection. The paper research construction of an intrusion detection system that ensures the confidentiality of critical data at a given level of intrusion detection acc...

Full description

Bibliographic Details
Main Authors:	Evgenia Novikova, Elena Doynikova, Sergey Golubev
Format:	Article
Language:	English
Published:	MDPI AG 2022-03-01
Series:	Algorithms
Subjects:	intrusion detection critical infrastructures confidential data federated learning vertically partitioned data gradient boosting decision trees
Online Access:	https://www.mdpi.com/1999-4893/15/4/104

_version_	1797437221651873792
author	Evgenia Novikova Elena Doynikova Sergey Golubev
author_facet	Evgenia Novikova Elena Doynikova Sergey Golubev
author_sort	Evgenia Novikova
collection	DOAJ
description	One of the challenges in the Internet of Things systems is the security of the critical data, for example, data used for intrusion detection. The paper research construction of an intrusion detection system that ensures the confidentiality of critical data at a given level of intrusion detection accuracy. For this goal, federated learning is used to train an intrusion detection model. Federated learning is a computational model for distributed machine learning that allows different collaborating entities to train one global model without sharing data. This paper considers the case when entities have data that are different in attributes. Authors believe that it is a common situation for the critical systems constructed using Internet of Things (IoT) technology, when industrial objects are monitored by different sets of sensors. To evaluate the applicability of the federated learning for this case, the authors developed an approach and an architecture of the intrusion detection system for vertically partitioned data that consider the principles of federated learning and conducted the series of experiments. To model vertically partitioned data, the authors used the Secure Water Treatment (SWaT) data set that describes the functioning of the water treatment facility. The conducted experiments demonstrate that the accuracy of the intrusion detection model trained using federated learning is compared with the accuracy of the intrusion detection model trained using the centralized machine learning model. However, the computational efficiency of the learning and inference process is currently extremely low. It is explained by the application of homomorphic encryption for input data protection from different data owners or data sources. This defines the necessity to elaborate techniques for generating attributes that could model horizontally partitioned data even for the cases when the collaborating entities share datasets that differ in their attributes.
first_indexed	2024-03-09T11:16:44Z
format	Article
id	doaj.art-4196f21aa07044878d09417f690b2793
institution	Directory Open Access Journal
issn	1999-4893
language	English
last_indexed	2024-03-09T11:16:44Z
publishDate	2022-03-01
publisher	MDPI AG
record_format	Article
series	Algorithms
spelling	doaj.art-4196f21aa07044878d09417f690b27932023-12-01T00:28:35ZengMDPI AGAlgorithms1999-48932022-03-0115410410.3390/a15040104Federated Learning for Intrusion Detection in the Critical Infrastructures: Vertically Partitioned Data Use CaseEvgenia Novikova0Elena Doynikova1Sergey Golubev2Department of Computer Science and Engineering, St. Petersburg Electrotechnical University “LETI”, 197022 St. Petersburg, RussiaComputer Security Problems Laboratory, St. Petersburg Federal Research Center of the Russian Academy of Sciences, 199178 Saint-Petersburg, RussiaComputer Security Problems Laboratory, St. Petersburg Federal Research Center of the Russian Academy of Sciences, 199178 Saint-Petersburg, RussiaOne of the challenges in the Internet of Things systems is the security of the critical data, for example, data used for intrusion detection. The paper research construction of an intrusion detection system that ensures the confidentiality of critical data at a given level of intrusion detection accuracy. For this goal, federated learning is used to train an intrusion detection model. Federated learning is a computational model for distributed machine learning that allows different collaborating entities to train one global model without sharing data. This paper considers the case when entities have data that are different in attributes. Authors believe that it is a common situation for the critical systems constructed using Internet of Things (IoT) technology, when industrial objects are monitored by different sets of sensors. To evaluate the applicability of the federated learning for this case, the authors developed an approach and an architecture of the intrusion detection system for vertically partitioned data that consider the principles of federated learning and conducted the series of experiments. To model vertically partitioned data, the authors used the Secure Water Treatment (SWaT) data set that describes the functioning of the water treatment facility. The conducted experiments demonstrate that the accuracy of the intrusion detection model trained using federated learning is compared with the accuracy of the intrusion detection model trained using the centralized machine learning model. However, the computational efficiency of the learning and inference process is currently extremely low. It is explained by the application of homomorphic encryption for input data protection from different data owners or data sources. This defines the necessity to elaborate techniques for generating attributes that could model horizontally partitioned data even for the cases when the collaborating entities share datasets that differ in their attributes.https://www.mdpi.com/1999-4893/15/4/104intrusion detectioncritical infrastructuresconfidential datafederated learningvertically partitioned datagradient boosting decision trees
spellingShingle	Evgenia Novikova Elena Doynikova Sergey Golubev Federated Learning for Intrusion Detection in the Critical Infrastructures: Vertically Partitioned Data Use Case Algorithms intrusion detection critical infrastructures confidential data federated learning vertically partitioned data gradient boosting decision trees
title	Federated Learning for Intrusion Detection in the Critical Infrastructures: Vertically Partitioned Data Use Case
title_full	Federated Learning for Intrusion Detection in the Critical Infrastructures: Vertically Partitioned Data Use Case
title_fullStr	Federated Learning for Intrusion Detection in the Critical Infrastructures: Vertically Partitioned Data Use Case
title_full_unstemmed	Federated Learning for Intrusion Detection in the Critical Infrastructures: Vertically Partitioned Data Use Case
title_short	Federated Learning for Intrusion Detection in the Critical Infrastructures: Vertically Partitioned Data Use Case
title_sort	federated learning for intrusion detection in the critical infrastructures vertically partitioned data use case
topic	intrusion detection critical infrastructures confidential data federated learning vertically partitioned data gradient boosting decision trees
url	https://www.mdpi.com/1999-4893/15/4/104
work_keys_str_mv	AT evgenianovikova federatedlearningforintrusiondetectioninthecriticalinfrastructuresverticallypartitioneddatausecase AT elenadoynikova federatedlearningforintrusiondetectioninthecriticalinfrastructuresverticallypartitioneddatausecase AT sergeygolubev federatedlearningforintrusiondetectioninthecriticalinfrastructuresverticallypartitioneddatausecase

Federated Learning for Intrusion Detection in the Critical Infrastructures: Vertically Partitioned Data Use Case

Similar Items