LongCGDroid: Android malware detection through longitudinal study for machine learning and deep learning

This study aims to compare the longitudinal performance between machine learning and deep learning classifiers for Android malware detection, employing different levels of feature abstraction. Using a dataset of 200k Android apps labeled by date within a 10-year range (2013-2022), we propose the Lon...

Full description

Bibliographic Details
Main Authors: Abdelhak Mesbah, Ibtihel Baddari, Mohamed Amine Raihla
Format: Article
Language:English
Published: Scientific Research Support Fund of Jordan (SRSF) and Princess Sumaya University for Technology (PSUT) 2023-12-01
Series:Jordanian Journal of Computers and Information Technology
Subjects:
Online Access:https://www.jjcit.org/?mno=167554
_version_ 1797441209335021568
author Abdelhak Mesbah
Ibtihel Baddari
Mohamed Amine Raihla
author_facet Abdelhak Mesbah
Ibtihel Baddari
Mohamed Amine Raihla
author_sort Abdelhak Mesbah
collection DOAJ
description This study aims to compare the longitudinal performance between machine learning and deep learning classifiers for Android malware detection, employing different levels of feature abstraction. Using a dataset of 200k Android apps labeled by date within a 10-year range (2013-2022), we propose the LongCGDroid, an image-based effective approach for Android malware detection. We use the semantic Call Graph API representation that is derived from the Control Flow Graph and Data Flow Graph to extract abstracted API calls. Thus, we evaluate the longitudinal performance of LongCGDroid against API changes. Different models are used, machine learning models (LR, RF, KNN, SVM) and deep learning models (CNN, RNN). Empirical experiments demonstrate a progressive decline in performance for all classifiers when evaluated on samples from later periods. Whereas, the deep learning CNN model under the class abstraction maintains a certain stability over time. In comparison with eight state-of-the-art approaches, LongCGDroid achieves higher accuracy. [JJCIT 2023; 9(4.000): 328-346]
first_indexed 2024-03-09T12:20:44Z
format Article
id doaj.art-4dc1c77bbc2a424ab19ecd75cc205e1b
institution Directory Open Access Journal
issn 2413-9351
2415-1076
language English
last_indexed 2024-03-09T12:20:44Z
publishDate 2023-12-01
publisher Scientific Research Support Fund of Jordan (SRSF) and Princess Sumaya University for Technology (PSUT)
record_format Article
series Jordanian Journal of Computers and Information Technology
spelling doaj.art-4dc1c77bbc2a424ab19ecd75cc205e1b2023-11-30T22:41:05ZengScientific Research Support Fund of Jordan (SRSF) and Princess Sumaya University for Technology (PSUT)Jordanian Journal of Computers and Information Technology2413-93512415-10762023-12-019432834610.5455/jjcit.71-1693392249167554LongCGDroid: Android malware detection through longitudinal study for machine learning and deep learningAbdelhak Mesbah0Ibtihel Baddari1Mohamed Amine Raihla2University M'Hamed Bougara of Boumerdes University M'Hamed Bougara of Boumerdes University M'Hamed Bougara of BoumerdesThis study aims to compare the longitudinal performance between machine learning and deep learning classifiers for Android malware detection, employing different levels of feature abstraction. Using a dataset of 200k Android apps labeled by date within a 10-year range (2013-2022), we propose the LongCGDroid, an image-based effective approach for Android malware detection. We use the semantic Call Graph API representation that is derived from the Control Flow Graph and Data Flow Graph to extract abstracted API calls. Thus, we evaluate the longitudinal performance of LongCGDroid against API changes. Different models are used, machine learning models (LR, RF, KNN, SVM) and deep learning models (CNN, RNN). Empirical experiments demonstrate a progressive decline in performance for all classifiers when evaluated on samples from later periods. Whereas, the deep learning CNN model under the class abstraction maintains a certain stability over time. In comparison with eight state-of-the-art approaches, LongCGDroid achieves higher accuracy. [JJCIT 2023; 9(4.000): 328-346]https://www.jjcit.org/?mno=167554android securitymalware detectionmachine learningadjacency matrixlongitudinal evaluation
spellingShingle Abdelhak Mesbah
Ibtihel Baddari
Mohamed Amine Raihla
LongCGDroid: Android malware detection through longitudinal study for machine learning and deep learning
Jordanian Journal of Computers and Information Technology
android security
malware detection
machine learning
adjacency matrix
longitudinal evaluation
title LongCGDroid: Android malware detection through longitudinal study for machine learning and deep learning
title_full LongCGDroid: Android malware detection through longitudinal study for machine learning and deep learning
title_fullStr LongCGDroid: Android malware detection through longitudinal study for machine learning and deep learning
title_full_unstemmed LongCGDroid: Android malware detection through longitudinal study for machine learning and deep learning
title_short LongCGDroid: Android malware detection through longitudinal study for machine learning and deep learning
title_sort longcgdroid android malware detection through longitudinal study for machine learning and deep learning
topic android security
malware detection
machine learning
adjacency matrix
longitudinal evaluation
url https://www.jjcit.org/?mno=167554
work_keys_str_mv AT abdelhakmesbah longcgdroidandroidmalwaredetectionthroughlongitudinalstudyformachinelearninganddeeplearning
AT ibtihelbaddari longcgdroidandroidmalwaredetectionthroughlongitudinalstudyformachinelearninganddeeplearning
AT mohamedamineraihla longcgdroidandroidmalwaredetectionthroughlongitudinalstudyformachinelearninganddeeplearning