Deep Learning Optimisation of Static Malware Detection with Grid Search and Covering Arrays

This paper investigates the impact of several hyperparameters on static malware detection using deep learning, including the number of epochs, batch size, number of layers and neurons, optimisation method, dropout rate, type of activation function, and learning rate. We employed the cAgen tool and g...

Full description

Bibliographic Details
Main Authors: Fahad T. ALGorain, Abdulrahman S. Alnaeem
Format: Article
Language:English
Published: MDPI AG 2023-05-01
Series:Telecom
Subjects:
Online Access:https://www.mdpi.com/2673-4001/4/2/15
Description
Summary:This paper investigates the impact of several hyperparameters on static malware detection using deep learning, including the number of epochs, batch size, number of layers and neurons, optimisation method, dropout rate, type of activation function, and learning rate. We employed the cAgen tool and grid search optimisation from the scikit-learn Python library to identify the best hyperparameters for our Keras deep learning model. Our experiments reveal that cAgen is more efficient than grid search in finding the optimal parameters, and we find that the selection of hyperparameter values has a significant impact on the model’s accuracy. Specifically, our approach leads to significant improvements in the neural network model’s accuracy for static malware detection on the Ember dataset (from 81.2% to 95.7%) and the Kaggle dataset (from 94% to 98.6%). These results demonstrate the effectiveness of our proposed approach, and have important implications for the field of static malware detection.
ISSN:2673-4001