Distributed layer-3 e-mail classification for spam control

This paper proposes a distributed layer-3 e-mail classification for spam control. E-mail packets are inferred in transit and tagged with an intra-packet spam score to indicate whether the packet forms a legitimate or spam e-mail. During e-mail packet reassembly, tags for an e-mail are aggregated to...

Full description

Bibliographic Details
Main Authors: Marsono, Muhammad N., El-Kharashi, M. Watheq, Gebali, Fayez, Ganti, Sudhakar
Format: Book Section
Published: IEEE Explore 2007
Subjects:
_version_ 1796855648640565248
author Marsono, Muhammad N.
El-Kharashi, M. Watheq
Gebali, Fayez
Ganti, Sudhakar
author_facet Marsono, Muhammad N.
El-Kharashi, M. Watheq
Gebali, Fayez
Ganti, Sudhakar
author_sort Marsono, Muhammad N.
collection ePrints
description This paper proposes a distributed layer-3 e-mail classification for spam control. E-mail packets are inferred in transit and tagged with an intra-packet spam score to indicate whether the packet forms a legitimate or spam e-mail. During e-mail packet reassembly, tags for an e-mail are aggregated to give an inter-packet spam score. The naive Bayes inference technique is used to evaluate the performance of the proposed approach compared to the full e-mail classification approach. Our simulation results show that the proposed approach exhibits a comparable spam precision (and confidence) to the full e-mail classification approach. Spam recall increases from 63% to 85% depending to the maximum transmission unit size, approaching the 87% of the full e-mail classification. For 67% spam-to-legitimate ratio, we obtain reduction of end servers's workload by 42% to 57% (across all maximum transmission unit sizes tested) of the total e-mail traffic. Thus, the proposed approach can complement existing anti-spam systems by pre-processing e-mail packets on upstream nodes. Layer-3 e-mail processing requires reduced processing complexity as compared to layer-7 processing and is viable for high throughput hardware-based implementations.
first_indexed 2024-03-05T18:31:39Z
format Book Section
id utm.eprints-17110
institution Universiti Teknologi Malaysia - ePrints
last_indexed 2024-03-05T18:31:39Z
publishDate 2007
publisher IEEE Explore
record_format dspace
spelling utm.eprints-171102017-02-05T03:12:26Z http://eprints.utm.my/17110/ Distributed layer-3 e-mail classification for spam control Marsono, Muhammad N. El-Kharashi, M. Watheq Gebali, Fayez Ganti, Sudhakar TK Electrical engineering. Electronics Nuclear engineering This paper proposes a distributed layer-3 e-mail classification for spam control. E-mail packets are inferred in transit and tagged with an intra-packet spam score to indicate whether the packet forms a legitimate or spam e-mail. During e-mail packet reassembly, tags for an e-mail are aggregated to give an inter-packet spam score. The naive Bayes inference technique is used to evaluate the performance of the proposed approach compared to the full e-mail classification approach. Our simulation results show that the proposed approach exhibits a comparable spam precision (and confidence) to the full e-mail classification approach. Spam recall increases from 63% to 85% depending to the maximum transmission unit size, approaching the 87% of the full e-mail classification. For 67% spam-to-legitimate ratio, we obtain reduction of end servers's workload by 42% to 57% (across all maximum transmission unit sizes tested) of the total e-mail traffic. Thus, the proposed approach can complement existing anti-spam systems by pre-processing e-mail packets on upstream nodes. Layer-3 e-mail processing requires reduced processing complexity as compared to layer-7 processing and is viable for high throughput hardware-based implementations. IEEE Explore 2007-01-15 Book Section PeerReviewed Marsono, Muhammad N. and El-Kharashi, M. Watheq and Gebali, Fayez and Ganti, Sudhakar (2007) Distributed layer-3 e-mail classification for spam control. In: Electrical and Computer Engineering, 2006. CCECE '06. Canadian Conference on. IEEE Explore, Ottawa, Ont., pp. 742-745. ISBN 1-4244-0038-4 http://dx.doi.org/10.1109/CCECE.2006.277810 10.1109/CCECE.2006.277810
spellingShingle TK Electrical engineering. Electronics Nuclear engineering
Marsono, Muhammad N.
El-Kharashi, M. Watheq
Gebali, Fayez
Ganti, Sudhakar
Distributed layer-3 e-mail classification for spam control
title Distributed layer-3 e-mail classification for spam control
title_full Distributed layer-3 e-mail classification for spam control
title_fullStr Distributed layer-3 e-mail classification for spam control
title_full_unstemmed Distributed layer-3 e-mail classification for spam control
title_short Distributed layer-3 e-mail classification for spam control
title_sort distributed layer 3 e mail classification for spam control
topic TK Electrical engineering. Electronics Nuclear engineering
work_keys_str_mv AT marsonomuhammadn distributedlayer3emailclassificationforspamcontrol
AT elkharashimwatheq distributedlayer3emailclassificationforspamcontrol
AT gebalifayez distributedlayer3emailclassificationforspamcontrol
AT gantisudhakar distributedlayer3emailclassificationforspamcontrol