Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review

 The rapid growth of social media platforms has led to an increase in hate speech. This has prompted the development of effective detection mechanisms that aim to mitigate the potential hazards and threats it poses to society. BERT (Bidirectional Encoder Representations from Transformers) has produ...

Full description

Bibliographic Details
Main Authors: Jinan Aljawazeri, Mahdi Nsaif Jasim
Format: Article
Language:English
Published: College of Education, Al-Iraqia University 2024-03-01
Series:Iraqi Journal for Computer Science and Mathematics
Subjects:
Online Access:http://journal.esj.edu.iq/index.php/IJCM/article/view/917
_version_ 1827316970974871552
author Jinan Aljawazeri
Mahdi Nsaif Jasim
author_facet Jinan Aljawazeri
Mahdi Nsaif Jasim
author_sort Jinan Aljawazeri
collection DOAJ
description  The rapid growth of social media platforms has led to an increase in hate speech. This has prompted the development of effective detection mechanisms that aim to mitigate the potential hazards and threats it poses to society. BERT (Bidirectional Encoder Representations from Transformers) has produced cutting-edge results in this field. This review paper aims to identify and analyze the whole process of using the BERT model to tackle the challenges associated with the hate speech detection problem. This academic discussion will begin by addressing the training datasets and the preprocessing methods involved. Subsequently, the use of the BERT model will be explored, followed by an examination of the contributions made to address the issues encountered. Finally, we will discuss the evaluation phase. The use of BERT included the application of two primary approaches. In the featurebased approach, BERT accepts textual input and generates its corresponding representation as output. The resulting output is then used as input for any classification model. The second approach involves the process of fine-tuning BERT using labeled datasets and then employing it directly for classification purposes. The controversial issues and open challenges that appeared at each stage were discussed. The results indicate that in both approaches, BERT has shown its efficacy relative to other models under contention. However, there is a need for greater attention and advancement to effectively solve the existing issues and constraints in the future.
first_indexed 2024-04-24T23:24:37Z
format Article
id doaj.art-2747fb7253054fbcb82857578da9cf61
institution Directory Open Access Journal
issn 2958-0544
2788-7421
language English
last_indexed 2024-04-24T23:24:37Z
publishDate 2024-03-01
publisher College of Education, Al-Iraqia University
record_format Article
series Iraqi Journal for Computer Science and Mathematics
spelling doaj.art-2747fb7253054fbcb82857578da9cf612024-03-16T02:41:27ZengCollege of Education, Al-Iraqia UniversityIraqi Journal for Computer Science and Mathematics2958-05442788-74212024-03-015210.52866/ijcsm.2024.05.02.001Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review Jinan Aljawazeri0Mahdi Nsaif Jasim1University of BabylonUniversity of Information Technology and Communications  The rapid growth of social media platforms has led to an increase in hate speech. This has prompted the development of effective detection mechanisms that aim to mitigate the potential hazards and threats it poses to society. BERT (Bidirectional Encoder Representations from Transformers) has produced cutting-edge results in this field. This review paper aims to identify and analyze the whole process of using the BERT model to tackle the challenges associated with the hate speech detection problem. This academic discussion will begin by addressing the training datasets and the preprocessing methods involved. Subsequently, the use of the BERT model will be explored, followed by an examination of the contributions made to address the issues encountered. Finally, we will discuss the evaluation phase. The use of BERT included the application of two primary approaches. In the featurebased approach, BERT accepts textual input and generates its corresponding representation as output. The resulting output is then used as input for any classification model. The second approach involves the process of fine-tuning BERT using labeled datasets and then employing it directly for classification purposes. The controversial issues and open challenges that appeared at each stage were discussed. The results indicate that in both approaches, BERT has shown its efficacy relative to other models under contention. However, there is a need for greater attention and advancement to effectively solve the existing issues and constraints in the future. http://journal.esj.edu.iq/index.php/IJCM/article/view/917Hate Speech DetectionBERTFeature ExtractionFinetuningchallenges
spellingShingle Jinan Aljawazeri
Mahdi Nsaif Jasim
Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review
Iraqi Journal for Computer Science and Mathematics
Hate Speech Detection
BERT
Feature Extraction
Finetuning
challenges
title Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review
title_full Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review
title_fullStr Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review
title_full_unstemmed Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review
title_short Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review
title_sort addressing challenges in hate speech detection using bert based models a review
topic Hate Speech Detection
BERT
Feature Extraction
Finetuning
challenges
url http://journal.esj.edu.iq/index.php/IJCM/article/view/917
work_keys_str_mv AT jinanaljawazeri addressingchallengesinhatespeechdetectionusingbertbasedmodelsareview
AT mahdinsaifjasim addressingchallengesinhatespeechdetectionusingbertbasedmodelsareview