Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review

 The rapid growth of social media platforms has led to an increase in hate speech. This has prompted the development of effective detection mechanisms that aim to mitigate the potential hazards and threats it poses to society. BERT (Bidirectional Encoder Representations from Transformers) has produ...

Full description

Bibliographic Details
Main Authors: Jinan Aljawazeri, Mahdi Nsaif Jasim
Format: Article
Language:English
Published: College of Education, Al-Iraqia University 2024-03-01
Series:Iraqi Journal for Computer Science and Mathematics
Subjects:
Online Access:http://journal.esj.edu.iq/index.php/IJCM/article/view/917
Description
Summary: The rapid growth of social media platforms has led to an increase in hate speech. This has prompted the development of effective detection mechanisms that aim to mitigate the potential hazards and threats it poses to society. BERT (Bidirectional Encoder Representations from Transformers) has produced cutting-edge results in this field. This review paper aims to identify and analyze the whole process of using the BERT model to tackle the challenges associated with the hate speech detection problem. This academic discussion will begin by addressing the training datasets and the preprocessing methods involved. Subsequently, the use of the BERT model will be explored, followed by an examination of the contributions made to address the issues encountered. Finally, we will discuss the evaluation phase. The use of BERT included the application of two primary approaches. In the featurebased approach, BERT accepts textual input and generates its corresponding representation as output. The resulting output is then used as input for any classification model. The second approach involves the process of fine-tuning BERT using labeled datasets and then employing it directly for classification purposes. The controversial issues and open challenges that appeared at each stage were discussed. The results indicate that in both approaches, BERT has shown its efficacy relative to other models under contention. However, there is a need for greater attention and advancement to effectively solve the existing issues and constraints in the future.
ISSN:2958-0544
2788-7421