Evaluation of feature selection algorithm for android malware detection

This paper synthesizes an evaluation of feature selection algorithm by utilizing Term Frequency Inverse Document Frequency (TF-IDF) as the main algorithm in Android malware detection. The Android features were filtered before detection process using TF-IDF algorithm. However, IDF is unaware to the t...

Full description

Bibliographic Details
Main Authors: Mazlan, Nurul Hidayah, A Hamid, Isredza Rahmi
Format: Article
Published: Science Publishing Corporation 2018
Subjects:
_version_ 1796869126581387264
author Mazlan, Nurul Hidayah
A Hamid, Isredza Rahmi
author_facet Mazlan, Nurul Hidayah
A Hamid, Isredza Rahmi
author_sort Mazlan, Nurul Hidayah
collection UTHM
description This paper synthesizes an evaluation of feature selection algorithm by utilizing Term Frequency Inverse Document Frequency (TF-IDF) as the main algorithm in Android malware detection. The Android features were filtered before detection process using TF-IDF algorithm. However, IDF is unaware to the training class labels and give incorrect weight value to some features. Therefore, the proposed approach modified the TF-IDF algorithm, where the algorithm focused on both sample and feature. Proposed algorithm applied considers the feature based on its level of importance. The related best features in the sample are selected using weight and priority ranking process. This increases the effect of important malware features selected in the Android application sample. These experiments are conducted on a sample collected from DREBIN dataset. The comparison between existing TF-IDF algorithm and modified TF-IDF (MTF-IDF) algorithm have been tested in various conditions such as different number of sample, different number of feature and combination of different types of feature. The analysis results show feature selection using MTF-IDF can improve malware detection analysis. MTF-IDF proved either using various kinds of feature or various kinds of dataset size, algorithm still effective for Android malware detection. MTF-IDF algorithm also proved that it could give appropriate scaling for all features in analyzing Android malware detection.
first_indexed 2024-03-05T21:50:10Z
format Article
id uthm.eprints-5019
institution Universiti Tun Hussein Onn Malaysia
last_indexed 2024-03-05T21:50:10Z
publishDate 2018
publisher Science Publishing Corporation
record_format dspace
spelling uthm.eprints-50192022-01-03T06:14:49Z http://eprints.uthm.edu.my/5019/ Evaluation of feature selection algorithm for android malware detection Mazlan, Nurul Hidayah A Hamid, Isredza Rahmi TA Engineering (General). Civil engineering (General) TA168 Systems engineering This paper synthesizes an evaluation of feature selection algorithm by utilizing Term Frequency Inverse Document Frequency (TF-IDF) as the main algorithm in Android malware detection. The Android features were filtered before detection process using TF-IDF algorithm. However, IDF is unaware to the training class labels and give incorrect weight value to some features. Therefore, the proposed approach modified the TF-IDF algorithm, where the algorithm focused on both sample and feature. Proposed algorithm applied considers the feature based on its level of importance. The related best features in the sample are selected using weight and priority ranking process. This increases the effect of important malware features selected in the Android application sample. These experiments are conducted on a sample collected from DREBIN dataset. The comparison between existing TF-IDF algorithm and modified TF-IDF (MTF-IDF) algorithm have been tested in various conditions such as different number of sample, different number of feature and combination of different types of feature. The analysis results show feature selection using MTF-IDF can improve malware detection analysis. MTF-IDF proved either using various kinds of feature or various kinds of dataset size, algorithm still effective for Android malware detection. MTF-IDF algorithm also proved that it could give appropriate scaling for all features in analyzing Android malware detection. Science Publishing Corporation 2018 Article PeerReviewed Mazlan, Nurul Hidayah and A Hamid, Isredza Rahmi (2018) Evaluation of feature selection algorithm for android malware detection. International Journal of Engineering & Technology, 7 (4.31). pp. 311-315. ISSN 2227-524X
spellingShingle TA Engineering (General). Civil engineering (General)
TA168 Systems engineering
Mazlan, Nurul Hidayah
A Hamid, Isredza Rahmi
Evaluation of feature selection algorithm for android malware detection
title Evaluation of feature selection algorithm for android malware detection
title_full Evaluation of feature selection algorithm for android malware detection
title_fullStr Evaluation of feature selection algorithm for android malware detection
title_full_unstemmed Evaluation of feature selection algorithm for android malware detection
title_short Evaluation of feature selection algorithm for android malware detection
title_sort evaluation of feature selection algorithm for android malware detection
topic TA Engineering (General). Civil engineering (General)
TA168 Systems engineering
work_keys_str_mv AT mazlannurulhidayah evaluationoffeatureselectionalgorithmforandroidmalwaredetection
AT ahamidisredzarahmi evaluationoffeatureselectionalgorithmforandroidmalwaredetection