Windows PE Malware Detection Using Ensemble Learning

In this Internet age, there are increasingly many threats to the security and safety of users daily. One of such threats is malicious software otherwise known as malware (ransomware, Trojans, viruses, etc.). The effect of this threat can lead to loss or malicious replacement of important information...

Full description

Bibliographic Details
Main Authors: Nureni Ayofe Azeez, Oluwanifise Ebunoluwa Odufuwa, Sanjay Misra, Jonathan Oluranti, Robertas Damaševičius
Format: Article
Language:English
Published: MDPI AG 2021-02-01
Series:Informatics
Subjects:
Online Access:https://www.mdpi.com/2227-9709/8/1/10
_version_ 1827601031374045184
author Nureni Ayofe Azeez
Oluwanifise Ebunoluwa Odufuwa
Sanjay Misra
Jonathan Oluranti
Robertas Damaševičius
author_facet Nureni Ayofe Azeez
Oluwanifise Ebunoluwa Odufuwa
Sanjay Misra
Jonathan Oluranti
Robertas Damaševičius
author_sort Nureni Ayofe Azeez
collection DOAJ
description In this Internet age, there are increasingly many threats to the security and safety of users daily. One of such threats is malicious software otherwise known as malware (ransomware, Trojans, viruses, etc.). The effect of this threat can lead to loss or malicious replacement of important information (such as bank account details, etc.). Malware creators have been able to bypass traditional methods of malware detection, which can be time-consuming and unreliable for unknown malware. This motivates the need for intelligent ways to detect malware, especially new malware which have not been evaluated or studied before. Machine learning provides an intelligent way to detect malware and comprises two stages: feature extraction and classification. This study suggests an ensemble learning-based method for malware detection. The base stage classification is done by a stacked ensemble of fully-connected and one-dimensional convolutional neural networks (CNNs), whereas the end-stage classification is done by a machine learning algorithm. For a meta-learner, we analyzed and compared 15 machine learning classifiers. For comparison, five machine learning algorithms were used: naïve Bayes, decision tree, random forest, gradient boosting, and AdaBoosting. The results of experiments made on the Windows Portable Executable (PE) malware dataset are presented. The best results were obtained by an ensemble of seven neural networks and the ExtraTrees classifier as a final-stage classifier.
first_indexed 2024-03-09T04:47:27Z
format Article
id doaj.art-f5714d339c9240308f942ac46f59d0e0
institution Directory Open Access Journal
issn 2227-9709
language English
last_indexed 2024-03-09T04:47:27Z
publishDate 2021-02-01
publisher MDPI AG
record_format Article
series Informatics
spelling doaj.art-f5714d339c9240308f942ac46f59d0e02023-12-03T13:14:45ZengMDPI AGInformatics2227-97092021-02-01811010.3390/informatics8010010Windows PE Malware Detection Using Ensemble LearningNureni Ayofe Azeez0Oluwanifise Ebunoluwa Odufuwa1Sanjay Misra2Jonathan Oluranti3Robertas Damaševičius4Department of Computer Sciences, Faculty of Science, University of Lagos, Lagos 100001, NigeriaDepartment of Computer Sciences, Faculty of Science, University of Lagos, Lagos 100001, NigeriaCenter of ICT/ICE Research, CUCRID, Covenant University, Ota 112212, NigeriaCenter of ICT/ICE Research, CUCRID, Covenant University, Ota 112212, NigeriaDepartment of Applied Informatics, Vytautas Magnus University, 44404 Kaunas, LithuaniaIn this Internet age, there are increasingly many threats to the security and safety of users daily. One of such threats is malicious software otherwise known as malware (ransomware, Trojans, viruses, etc.). The effect of this threat can lead to loss or malicious replacement of important information (such as bank account details, etc.). Malware creators have been able to bypass traditional methods of malware detection, which can be time-consuming and unreliable for unknown malware. This motivates the need for intelligent ways to detect malware, especially new malware which have not been evaluated or studied before. Machine learning provides an intelligent way to detect malware and comprises two stages: feature extraction and classification. This study suggests an ensemble learning-based method for malware detection. The base stage classification is done by a stacked ensemble of fully-connected and one-dimensional convolutional neural networks (CNNs), whereas the end-stage classification is done by a machine learning algorithm. For a meta-learner, we analyzed and compared 15 machine learning classifiers. For comparison, five machine learning algorithms were used: naïve Bayes, decision tree, random forest, gradient boosting, and AdaBoosting. The results of experiments made on the Windows Portable Executable (PE) malware dataset are presented. The best results were obtained by an ensemble of seven neural networks and the ExtraTrees classifier as a final-stage classifier.https://www.mdpi.com/2227-9709/8/1/10malware detectiondeep learningensemble learningstacking
spellingShingle Nureni Ayofe Azeez
Oluwanifise Ebunoluwa Odufuwa
Sanjay Misra
Jonathan Oluranti
Robertas Damaševičius
Windows PE Malware Detection Using Ensemble Learning
Informatics
malware detection
deep learning
ensemble learning
stacking
title Windows PE Malware Detection Using Ensemble Learning
title_full Windows PE Malware Detection Using Ensemble Learning
title_fullStr Windows PE Malware Detection Using Ensemble Learning
title_full_unstemmed Windows PE Malware Detection Using Ensemble Learning
title_short Windows PE Malware Detection Using Ensemble Learning
title_sort windows pe malware detection using ensemble learning
topic malware detection
deep learning
ensemble learning
stacking
url https://www.mdpi.com/2227-9709/8/1/10
work_keys_str_mv AT nureniayofeazeez windowspemalwaredetectionusingensemblelearning
AT oluwanifiseebunoluwaodufuwa windowspemalwaredetectionusingensemblelearning
AT sanjaymisra windowspemalwaredetectionusingensemblelearning
AT jonathanoluranti windowspemalwaredetectionusingensemblelearning
AT robertasdamasevicius windowspemalwaredetectionusingensemblelearning