A new approach to filtering spam SMS: Motif Patterns
Along with the widespread of every technology, it comes with many problems. Mobile Short Message Service (SMS), which is widely used in mobile technologies, has brought many problems. The most important problem of SMS is unwanted messages named spam that are spread on the mobile network. Spam messa...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Gazi University
2018-06-01
|
Series: | Gazi Üniversitesi Fen Bilimleri Dergisi |
Subjects: | |
Online Access: | http://dergipark.gov.tr/download/article-file/476007 |
Summary: | Along with the widespread of every technology, it comes with many problems. Mobile Short Message Service (SMS), which is widely used in mobile technologies, has brought many
problems. The most important problem of SMS is unwanted messages named spam that are spread on the mobile network. Spam messages prevent mobile traffic and keep people busy
unnecessarily. In this study to filter SMS spam, a novel feature extraction method, motif pattern method, is proposed, which uses forms that composed of comparision on UTF-8 codes of characters. In the proposed motif pattern method, the appearance of the values entered into a window size (PB) defined on the unicodes of SMS is considered as a motif pattern. The frequencies of these motifs in the SMS are used as the feature vector. The motif types depend on the specified PB. Three benchmark datasets were used to test the motif pattern method. The success rate was 93.76%, 90.07% and 94.29%, respectively, for three sets of data. According to the observed results, it is seen that the proposed method is a successful feature extraction method from SMS messages in spam filtering. It is also thought that the motif method can be used in other text mining, natural language processing fields |
---|---|
ISSN: | 2147-9526 2147-9526 |