Developing Cross-Domain Host-Based Intrusion Detection

Digital transformation has continued to have a remarkable impact on industries, creating new possibilities and improving the performance of existing ones. Recently, we have seen more deployments of cyber-physical systems and the Internet of Things (IoT) as in no other time. However, cybersecurity is...

Full description

Bibliographic Details
Main Authors:	Oluwagbemiga Ajayi, Aryya Gangopadhyay, Robert F. Erbacher, Carl Bursat
Format:	Article
Language:	English
Published:	MDPI AG 2022-11-01
Series:	Electronics
Subjects:	deep learning cybersecurity HIDS transfer learning word embedding similarity measure
Online Access:	https://www.mdpi.com/2079-9292/11/21/3631

_version_	1797468509363503104
author	Oluwagbemiga Ajayi Aryya Gangopadhyay Robert F. Erbacher Carl Bursat
author_facet	Oluwagbemiga Ajayi Aryya Gangopadhyay Robert F. Erbacher Carl Bursat
author_sort	Oluwagbemiga Ajayi
collection	DOAJ
description	Digital transformation has continued to have a remarkable impact on industries, creating new possibilities and improving the performance of existing ones. Recently, we have seen more deployments of cyber-physical systems and the Internet of Things (IoT) as in no other time. However, cybersecurity is often an afterthought in the design and implementation of many systems; therefore, there usually is an introduction of new attack surfaces as new systems and applications are being deployed. Machine learning has been helpful in creating intrusion detection models, but it is impractical to create attack detection models with acceptable performance for every single computing infrastructure and the various attack scenarios due to the cost of collecting quality labeled data and training models. Hence, there is a need to develop models that can take advantage of knowledge available in a high resource source domain to improve performance of a low resource target domain model. In this work, we propose a novel cross-domain deep learning-based approach for attack detection in Host-based Intrusion Detection Systems (HIDS). Specifically, we developed a method for candidate source domain selection from among a group of potential source domains by computing the similarity score a target domain records when paired with a potential source domain. Then, using different word embedding space combination techniques and transfer learning approach, we leverage the knowledge from a well performing source domain model to improve the performance of a similar model in the target domain. To evaluate our proposed approach, we used Leipzig Intrusion Detection Dataset (LID-DS), a HIDS dataset recorded on a modern operating system that consists of different attack scenarios. Our proposed cross-domain approach recorded significant improvement in the target domains when compared with the results from in-domain approach experiments. Based on the result, the F2-score of the target domain CWE-307 improved from 80% in the in-domain approach to 87% in the cross-domain approach while the target domain CVE-2014-0160 improved from 13% to 85%.
first_indexed	2024-03-09T19:07:29Z
format	Article
id	doaj.art-7b7fa805b0434ddea9e64b8bcdcb0561
institution	Directory Open Access Journal
issn	2079-9292
language	English
last_indexed	2024-03-09T19:07:29Z
publishDate	2022-11-01
publisher	MDPI AG
record_format	Article
series	Electronics
spelling	doaj.art-7b7fa805b0434ddea9e64b8bcdcb05612023-11-24T04:27:02ZengMDPI AGElectronics2079-92922022-11-011121363110.3390/electronics11213631Developing Cross-Domain Host-Based Intrusion DetectionOluwagbemiga Ajayi0Aryya Gangopadhyay1Robert F. Erbacher2Carl Bursat3Department of Information Systems, University of Maryland Baltimore County, Baltimore, MD 21250, USADepartment of Information Systems, University of Maryland Baltimore County, Baltimore, MD 21250, USADEVCOM Army Research Laboratory, Adelphi, MD 20783, USADEVCOM Army Research Laboratory, Adelphi, MD 20783, USADigital transformation has continued to have a remarkable impact on industries, creating new possibilities and improving the performance of existing ones. Recently, we have seen more deployments of cyber-physical systems and the Internet of Things (IoT) as in no other time. However, cybersecurity is often an afterthought in the design and implementation of many systems; therefore, there usually is an introduction of new attack surfaces as new systems and applications are being deployed. Machine learning has been helpful in creating intrusion detection models, but it is impractical to create attack detection models with acceptable performance for every single computing infrastructure and the various attack scenarios due to the cost of collecting quality labeled data and training models. Hence, there is a need to develop models that can take advantage of knowledge available in a high resource source domain to improve performance of a low resource target domain model. In this work, we propose a novel cross-domain deep learning-based approach for attack detection in Host-based Intrusion Detection Systems (HIDS). Specifically, we developed a method for candidate source domain selection from among a group of potential source domains by computing the similarity score a target domain records when paired with a potential source domain. Then, using different word embedding space combination techniques and transfer learning approach, we leverage the knowledge from a well performing source domain model to improve the performance of a similar model in the target domain. To evaluate our proposed approach, we used Leipzig Intrusion Detection Dataset (LID-DS), a HIDS dataset recorded on a modern operating system that consists of different attack scenarios. Our proposed cross-domain approach recorded significant improvement in the target domains when compared with the results from in-domain approach experiments. Based on the result, the F2-score of the target domain CWE-307 improved from 80% in the in-domain approach to 87% in the cross-domain approach while the target domain CVE-2014-0160 improved from 13% to 85%.https://www.mdpi.com/2079-9292/11/21/3631deep learningcybersecurityHIDStransfer learningword embeddingsimilarity measure
spellingShingle	Oluwagbemiga Ajayi Aryya Gangopadhyay Robert F. Erbacher Carl Bursat Developing Cross-Domain Host-Based Intrusion Detection Electronics deep learning cybersecurity HIDS transfer learning word embedding similarity measure
title	Developing Cross-Domain Host-Based Intrusion Detection
title_full	Developing Cross-Domain Host-Based Intrusion Detection
title_fullStr	Developing Cross-Domain Host-Based Intrusion Detection
title_full_unstemmed	Developing Cross-Domain Host-Based Intrusion Detection
title_short	Developing Cross-Domain Host-Based Intrusion Detection
title_sort	developing cross domain host based intrusion detection
topic	deep learning cybersecurity HIDS transfer learning word embedding similarity measure
url	https://www.mdpi.com/2079-9292/11/21/3631
work_keys_str_mv	AT oluwagbemigaajayi developingcrossdomainhostbasedintrusiondetection AT aryyagangopadhyay developingcrossdomainhostbasedintrusiondetection AT robertferbacher developingcrossdomainhostbasedintrusiondetection AT carlbursat developingcrossdomainhostbasedintrusiondetection

Developing Cross-Domain Host-Based Intrusion Detection

Similar Items