An Ensemble-Based Multi-Classification Machine Learning Classifiers Approach to Detect Multiple Classes of Cyberbullying

The impact of communication through social media is currently considered a significant social issue. This issue can lead to inappropriate behavior using social media, which is referred to as cyberbullying. Automated systems are capable of efficiently identifying cyberbullying and performing sentimen...

Full description

Bibliographic Details
Main Authors: Abdulkarim Faraj Alqahtani, Mohammad Ilyas
Format: Article
Language:English
Published: MDPI AG 2024-01-01
Series:Machine Learning and Knowledge Extraction
Subjects:
Online Access:https://www.mdpi.com/2504-4990/6/1/9
_version_ 1797240215628152832
author Abdulkarim Faraj Alqahtani
Mohammad Ilyas
author_facet Abdulkarim Faraj Alqahtani
Mohammad Ilyas
author_sort Abdulkarim Faraj Alqahtani
collection DOAJ
description The impact of communication through social media is currently considered a significant social issue. This issue can lead to inappropriate behavior using social media, which is referred to as cyberbullying. Automated systems are capable of efficiently identifying cyberbullying and performing sentiment analysis on social media platforms. This study focuses on enhancing a system to detect six types of cyberbullying tweets. Employing multi-classification algorithms on a cyberbullying dataset, our approach achieved high accuracy, particularly with the TF-IDF (bigram) feature extraction. Our experiment achieved high performance compared with that stated for previous experiments on the same dataset. Two ensemble machine learning methods, employing the N-gram with TF-IDF feature-extraction technique, demonstrated superior performance in classification. Three popular multi-classification algorithms: Decision Trees, Random Forest, and XGBoost, were combined into two varied ensemble methods separately. These ensemble classifiers demonstrated superior performance compared to traditional machine learning classifier models. The stacking classifier reached 90.71% accuracy and the voting classifier 90.44%. The results of the experiments showed that the framework can detect six different types of cyberbullying more efficiently, with an accuracy rate of 0.9071.
first_indexed 2024-04-24T18:03:54Z
format Article
id doaj.art-6dd7eb5102114877beafec7a371eba64
institution Directory Open Access Journal
issn 2504-4990
language English
last_indexed 2024-04-24T18:03:54Z
publishDate 2024-01-01
publisher MDPI AG
record_format Article
series Machine Learning and Knowledge Extraction
spelling doaj.art-6dd7eb5102114877beafec7a371eba642024-03-27T13:52:04ZengMDPI AGMachine Learning and Knowledge Extraction2504-49902024-01-016115617010.3390/make6010009An Ensemble-Based Multi-Classification Machine Learning Classifiers Approach to Detect Multiple Classes of CyberbullyingAbdulkarim Faraj Alqahtani0Mohammad Ilyas1Electrical Engineering and Computer Science, Florida Atlantic University, 777 Glades Road, Boca Raton, FL 33431, USAElectrical Engineering and Computer Science, Florida Atlantic University, 777 Glades Road, Boca Raton, FL 33431, USAThe impact of communication through social media is currently considered a significant social issue. This issue can lead to inappropriate behavior using social media, which is referred to as cyberbullying. Automated systems are capable of efficiently identifying cyberbullying and performing sentiment analysis on social media platforms. This study focuses on enhancing a system to detect six types of cyberbullying tweets. Employing multi-classification algorithms on a cyberbullying dataset, our approach achieved high accuracy, particularly with the TF-IDF (bigram) feature extraction. Our experiment achieved high performance compared with that stated for previous experiments on the same dataset. Two ensemble machine learning methods, employing the N-gram with TF-IDF feature-extraction technique, demonstrated superior performance in classification. Three popular multi-classification algorithms: Decision Trees, Random Forest, and XGBoost, were combined into two varied ensemble methods separately. These ensemble classifiers demonstrated superior performance compared to traditional machine learning classifier models. The stacking classifier reached 90.71% accuracy and the voting classifier 90.44%. The results of the experiments showed that the framework can detect six different types of cyberbullying more efficiently, with an accuracy rate of 0.9071.https://www.mdpi.com/2504-4990/6/1/9ensemble modelscyberbullyingmulti-classificationmulticlassTF-IDFN-gram
spellingShingle Abdulkarim Faraj Alqahtani
Mohammad Ilyas
An Ensemble-Based Multi-Classification Machine Learning Classifiers Approach to Detect Multiple Classes of Cyberbullying
Machine Learning and Knowledge Extraction
ensemble models
cyberbullying
multi-classification
multiclass
TF-IDF
N-gram
title An Ensemble-Based Multi-Classification Machine Learning Classifiers Approach to Detect Multiple Classes of Cyberbullying
title_full An Ensemble-Based Multi-Classification Machine Learning Classifiers Approach to Detect Multiple Classes of Cyberbullying
title_fullStr An Ensemble-Based Multi-Classification Machine Learning Classifiers Approach to Detect Multiple Classes of Cyberbullying
title_full_unstemmed An Ensemble-Based Multi-Classification Machine Learning Classifiers Approach to Detect Multiple Classes of Cyberbullying
title_short An Ensemble-Based Multi-Classification Machine Learning Classifiers Approach to Detect Multiple Classes of Cyberbullying
title_sort ensemble based multi classification machine learning classifiers approach to detect multiple classes of cyberbullying
topic ensemble models
cyberbullying
multi-classification
multiclass
TF-IDF
N-gram
url https://www.mdpi.com/2504-4990/6/1/9
work_keys_str_mv AT abdulkarimfarajalqahtani anensemblebasedmulticlassificationmachinelearningclassifiersapproachtodetectmultipleclassesofcyberbullying
AT mohammadilyas anensemblebasedmulticlassificationmachinelearningclassifiersapproachtodetectmultipleclassesofcyberbullying
AT abdulkarimfarajalqahtani ensemblebasedmulticlassificationmachinelearningclassifiersapproachtodetectmultipleclassesofcyberbullying
AT mohammadilyas ensemblebasedmulticlassificationmachinelearningclassifiersapproachtodetectmultipleclassesofcyberbullying