A Hybrid CNN-LSTM Model for SMS Spam Detection in Arabic and English Messages

Despite the rapid evolution of Internet protocol-based messaging services, SMS still remains an indisputable communication service in our lives until today. For example, several businesses consider that text messages are more effective than e-mails. This is because 82% of SMSs are read within 5 min....

Full description

Bibliographic Details
Main Authors: Abdallah Ghourabi, Mahmood A. Mahmood, Qusay M. Alzubi
Format: Article
Language:English
Published: MDPI AG 2020-09-01
Series:Future Internet
Subjects:
Online Access:https://www.mdpi.com/1999-5903/12/9/156
Description
Summary:Despite the rapid evolution of Internet protocol-based messaging services, SMS still remains an indisputable communication service in our lives until today. For example, several businesses consider that text messages are more effective than e-mails. This is because 82% of SMSs are read within 5 min., but consumers only open one in four e-mails they receive. The importance of SMS for mobile phone users has attracted the attention of spammers. In fact, the volume of SMS spam has increased considerably in recent years with the emergence of new security threats, such as SMiShing. In this paper, we propose a hybrid deep learning model for detecting SMS spam messages. This detection model is based on the combination of two deep learning methods CNN and LSTM. It is intended to deal with mixed text messages that are written in Arabic or English. For the comparative evaluation, we also tested other well-known machine learning algorithms. The experimental results that we present in this paper show that our CNN-LSTM model outperforms the other algorithms. It achieved a very good accuracy of 98.37%.
ISSN:1999-5903