Multi-Type Feature Extraction and Early Fusion Framework for SMS Spam Detection
SMS spam is a pervasive issue that affects millions worldwide, leading to significant inconvenience, time wastage, and potential financial scams. Given the prevalence and potential harm, accurate and real-time detection of SMS spam is crucial. This paper proposes a novel approach to SMS spam detecti...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2023-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10296923/ |
_version_ | 1797632795754889216 |
---|---|
author | Hussein Alaa Al-Kabbi Mohammad-Reza Feizi-Derakhshi Saeid Pashazadeh |
author_facet | Hussein Alaa Al-Kabbi Mohammad-Reza Feizi-Derakhshi Saeid Pashazadeh |
author_sort | Hussein Alaa Al-Kabbi |
collection | DOAJ |
description | SMS spam is a pervasive issue that affects millions worldwide, leading to significant inconvenience, time wastage, and potential financial scams. Given the prevalence and potential harm, accurate and real-time detection of SMS spam is crucial. This paper proposes a novel approach to SMS spam detection involving five steps: preprocessing, feature extraction, feature fusion, feature selection, and classification. Our model is designed to simultaneously capture local, temporal, and global text message features using a hybrid deep learning model to enhance feature representation. We evaluated our model using the UCI dataset, comparing it with traditional and deep learning algorithms such as RF and BERT using cross-validation to ensure the robustness of our results. Our proposed method exhibited superior performance, achieving a good accuracy of 99.56%, surpassing other methods. The effectiveness of this method in SMS spam detection proved its potential for real-world implementation, where it could substantially mitigate the prevalence and impact of SMS spam. |
first_indexed | 2024-03-11T11:42:51Z |
format | Article |
id | doaj.art-c068487d0cef472ab5f1412957425cea |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-03-11T11:42:51Z |
publishDate | 2023-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-c068487d0cef472ab5f1412957425cea2023-11-10T00:00:59ZengIEEEIEEE Access2169-35362023-01-011112375612376510.1109/ACCESS.2023.332789710296923Multi-Type Feature Extraction and Early Fusion Framework for SMS Spam DetectionHussein Alaa Al-Kabbi0https://orcid.org/0000-0001-9230-7947Mohammad-Reza Feizi-Derakhshi1https://orcid.org/0000-0002-8548-976XSaeid Pashazadeh2https://orcid.org/0000-0002-8949-9180Department of Computer Engineering, Computerized Intelligence Systems Laboratory, University of Tabriz, Tabriz, IranDepartment of Computer Engineering, Computerized Intelligence Systems Laboratory, University of Tabriz, Tabriz, IranDepartment of Computer Engineering, University of Tabriz, Tabriz, IranSMS spam is a pervasive issue that affects millions worldwide, leading to significant inconvenience, time wastage, and potential financial scams. Given the prevalence and potential harm, accurate and real-time detection of SMS spam is crucial. This paper proposes a novel approach to SMS spam detection involving five steps: preprocessing, feature extraction, feature fusion, feature selection, and classification. Our model is designed to simultaneously capture local, temporal, and global text message features using a hybrid deep learning model to enhance feature representation. We evaluated our model using the UCI dataset, comparing it with traditional and deep learning algorithms such as RF and BERT using cross-validation to ensure the robustness of our results. Our proposed method exhibited superior performance, achieving a good accuracy of 99.56%, surpassing other methods. The effectiveness of this method in SMS spam detection proved its potential for real-world implementation, where it could substantially mitigate the prevalence and impact of SMS spam.https://ieeexplore.ieee.org/document/10296923/CNNdata fusiondeep learningLSTMSMS spam detection |
spellingShingle | Hussein Alaa Al-Kabbi Mohammad-Reza Feizi-Derakhshi Saeid Pashazadeh Multi-Type Feature Extraction and Early Fusion Framework for SMS Spam Detection IEEE Access CNN data fusion deep learning LSTM SMS spam detection |
title | Multi-Type Feature Extraction and Early Fusion Framework for SMS Spam Detection |
title_full | Multi-Type Feature Extraction and Early Fusion Framework for SMS Spam Detection |
title_fullStr | Multi-Type Feature Extraction and Early Fusion Framework for SMS Spam Detection |
title_full_unstemmed | Multi-Type Feature Extraction and Early Fusion Framework for SMS Spam Detection |
title_short | Multi-Type Feature Extraction and Early Fusion Framework for SMS Spam Detection |
title_sort | multi type feature extraction and early fusion framework for sms spam detection |
topic | CNN data fusion deep learning LSTM SMS spam detection |
url | https://ieeexplore.ieee.org/document/10296923/ |
work_keys_str_mv | AT husseinalaaalkabbi multitypefeatureextractionandearlyfusionframeworkforsmsspamdetection AT mohammadrezafeiziderakhshi multitypefeatureextractionandearlyfusionframeworkforsmsspamdetection AT saeidpashazadeh multitypefeatureextractionandearlyfusionframeworkforsmsspamdetection |