A Decision-Fusion-Based Ensemble Approach for Malicious Websites Detection

Malicious websites detection is one of the cyber-security tasks that protects sensitive information such as credit card details and login credentials from attackers. Machine learning (ML)-based methods have been commonly used in several applications of cyber-security research. Although there are som...

Full description

Bibliographic Details
Main Authors: Abed Alanazi, Abdu Gumaei
Format: Article
Language:English
Published: MDPI AG 2023-09-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/13/18/10260
_version_ 1827727407867494400
author Abed Alanazi
Abdu Gumaei
author_facet Abed Alanazi
Abdu Gumaei
author_sort Abed Alanazi
collection DOAJ
description Malicious websites detection is one of the cyber-security tasks that protects sensitive information such as credit card details and login credentials from attackers. Machine learning (ML)-based methods have been commonly used in several applications of cyber-security research. Although there are some methods and approaches proposed in the state-of-the-art studies, the advancement of the most effective solution is still of research interest and needs to be improved. Recently, decision fusion methods play an important role in improving the accuracy of ML methods. They are broadly classified based on the type of fusion into a voting decision fusion technique and a divide and conquer decision fusion technique. In this paper, a decision fusion ensemble learning (DFEL) model is proposed based on voting technique for detecting malicious websites. It combines the predictions of three effective ensemble classifiers, namely, gradient boosting (GB) classifier, extreme gradient boosting (XGB) classifier, and random forest (RF) classifier. We use these classifiers because their advantages to perform well for class imbalanced and data with statistical noises such as in the case of malicious websites detection. A weighted majority-voting rule is utilized for generating the final decisions of used classifiers. The experimental results are conducted on a publicly available large dataset of malicious and benign websites. The comparative study exposed that the DFEL model achieves high accuracies, which are 97.25% on average of 10-fold cross-validation test and 98.50% on a holdout of 30% test set. This confirms the ability of proposed approach to improve the detection rate of malicious websites.
first_indexed 2024-03-10T23:04:45Z
format Article
id doaj.art-f1ae4260186749538371a60cd1ca7f9c
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-10T23:04:45Z
publishDate 2023-09-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-f1ae4260186749538371a60cd1ca7f9c2023-11-19T09:25:15ZengMDPI AGApplied Sciences2076-34172023-09-0113181026010.3390/app131810260A Decision-Fusion-Based Ensemble Approach for Malicious Websites DetectionAbed Alanazi0Abdu Gumaei1Department of Computer Science, College of Computer Engineering and Sciences, Prince Sattam Bin Abdulaziz University, Al-Kharj 11942, Saudi ArabiaDepartment of Computer Science, College of Computer Engineering and Sciences, Prince Sattam Bin Abdulaziz University, Al-Kharj 11942, Saudi ArabiaMalicious websites detection is one of the cyber-security tasks that protects sensitive information such as credit card details and login credentials from attackers. Machine learning (ML)-based methods have been commonly used in several applications of cyber-security research. Although there are some methods and approaches proposed in the state-of-the-art studies, the advancement of the most effective solution is still of research interest and needs to be improved. Recently, decision fusion methods play an important role in improving the accuracy of ML methods. They are broadly classified based on the type of fusion into a voting decision fusion technique and a divide and conquer decision fusion technique. In this paper, a decision fusion ensemble learning (DFEL) model is proposed based on voting technique for detecting malicious websites. It combines the predictions of three effective ensemble classifiers, namely, gradient boosting (GB) classifier, extreme gradient boosting (XGB) classifier, and random forest (RF) classifier. We use these classifiers because their advantages to perform well for class imbalanced and data with statistical noises such as in the case of malicious websites detection. A weighted majority-voting rule is utilized for generating the final decisions of used classifiers. The experimental results are conducted on a publicly available large dataset of malicious and benign websites. The comparative study exposed that the DFEL model achieves high accuracies, which are 97.25% on average of 10-fold cross-validation test and 98.50% on a holdout of 30% test set. This confirms the ability of proposed approach to improve the detection rate of malicious websites.https://www.mdpi.com/2076-3417/13/18/10260cyber-securitymalicious websitesbenign websitesURLsensemble learningdecision fusion ensemble learning (DFEL) model
spellingShingle Abed Alanazi
Abdu Gumaei
A Decision-Fusion-Based Ensemble Approach for Malicious Websites Detection
Applied Sciences
cyber-security
malicious websites
benign websites
URLs
ensemble learning
decision fusion ensemble learning (DFEL) model
title A Decision-Fusion-Based Ensemble Approach for Malicious Websites Detection
title_full A Decision-Fusion-Based Ensemble Approach for Malicious Websites Detection
title_fullStr A Decision-Fusion-Based Ensemble Approach for Malicious Websites Detection
title_full_unstemmed A Decision-Fusion-Based Ensemble Approach for Malicious Websites Detection
title_short A Decision-Fusion-Based Ensemble Approach for Malicious Websites Detection
title_sort decision fusion based ensemble approach for malicious websites detection
topic cyber-security
malicious websites
benign websites
URLs
ensemble learning
decision fusion ensemble learning (DFEL) model
url https://www.mdpi.com/2076-3417/13/18/10260
work_keys_str_mv AT abedalanazi adecisionfusionbasedensembleapproachformaliciouswebsitesdetection
AT abdugumaei adecisionfusionbasedensembleapproachformaliciouswebsitesdetection
AT abedalanazi decisionfusionbasedensembleapproachformaliciouswebsitesdetection
AT abdugumaei decisionfusionbasedensembleapproachformaliciouswebsitesdetection