Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review

The rapid growth of social media platforms has led to an increase in hate speech. This has prompted the development of effective detection mechanisms that aim to mitigate the potential hazards and threats it poses to society. BERT (Bidirectional Encoder Representations from Transformers) has produ...

Full description

Bibliographic Details
Main Authors:	Jinan Aljawazeri, Mahdi Nsaif Jasim
Format:	Article
Language:	English
Published:	College of Education, Al-Iraqia University 2024-03-01
Series:	Iraqi Journal for Computer Science and Mathematics
Subjects:	Hate Speech Detection BERT Feature Extraction Finetuning challenges
Online Access:	http://journal.esj.edu.iq/index.php/IJCM/article/view/917

_version_	1827316970974871552
author	Jinan Aljawazeri Mahdi Nsaif Jasim
author_facet	Jinan Aljawazeri Mahdi Nsaif Jasim
author_sort	Jinan Aljawazeri
collection	DOAJ
description	The rapid growth of social media platforms has led to an increase in hate speech. This has prompted the development of effective detection mechanisms that aim to mitigate the potential hazards and threats it poses to society. BERT (Bidirectional Encoder Representations from Transformers) has produced cutting-edge results in this field. This review paper aims to identify and analyze the whole process of using the BERT model to tackle the challenges associated with the hate speech detection problem. This academic discussion will begin by addressing the training datasets and the preprocessing methods involved. Subsequently, the use of the BERT model will be explored, followed by an examination of the contributions made to address the issues encountered. Finally, we will discuss the evaluation phase. The use of BERT included the application of two primary approaches. In the featurebased approach, BERT accepts textual input and generates its corresponding representation as output. The resulting output is then used as input for any classification model. The second approach involves the process of fine-tuning BERT using labeled datasets and then employing it directly for classification purposes. The controversial issues and open challenges that appeared at each stage were discussed. The results indicate that in both approaches, BERT has shown its efficacy relative to other models under contention. However, there is a need for greater attention and advancement to effectively solve the existing issues and constraints in the future.
first_indexed	2024-04-24T23:24:37Z
format	Article
id	doaj.art-2747fb7253054fbcb82857578da9cf61
institution	Directory Open Access Journal
issn	2958-0544 2788-7421
language	English
last_indexed	2024-04-24T23:24:37Z
publishDate	2024-03-01
publisher	College of Education, Al-Iraqia University
record_format	Article
series	Iraqi Journal for Computer Science and Mathematics
spelling	doaj.art-2747fb7253054fbcb82857578da9cf612024-03-16T02:41:27ZengCollege of Education, Al-Iraqia UniversityIraqi Journal for Computer Science and Mathematics2958-05442788-74212024-03-015210.52866/ijcsm.2024.05.02.001Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review Jinan Aljawazeri0Mahdi Nsaif Jasim1University of BabylonUniversity of Information Technology and Communications The rapid growth of social media platforms has led to an increase in hate speech. This has prompted the development of effective detection mechanisms that aim to mitigate the potential hazards and threats it poses to society. BERT (Bidirectional Encoder Representations from Transformers) has produced cutting-edge results in this field. This review paper aims to identify and analyze the whole process of using the BERT model to tackle the challenges associated with the hate speech detection problem. This academic discussion will begin by addressing the training datasets and the preprocessing methods involved. Subsequently, the use of the BERT model will be explored, followed by an examination of the contributions made to address the issues encountered. Finally, we will discuss the evaluation phase. The use of BERT included the application of two primary approaches. In the featurebased approach, BERT accepts textual input and generates its corresponding representation as output. The resulting output is then used as input for any classification model. The second approach involves the process of fine-tuning BERT using labeled datasets and then employing it directly for classification purposes. The controversial issues and open challenges that appeared at each stage were discussed. The results indicate that in both approaches, BERT has shown its efficacy relative to other models under contention. However, there is a need for greater attention and advancement to effectively solve the existing issues and constraints in the future. http://journal.esj.edu.iq/index.php/IJCM/article/view/917Hate Speech DetectionBERTFeature ExtractionFinetuningchallenges
spellingShingle	Jinan Aljawazeri Mahdi Nsaif Jasim Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review Iraqi Journal for Computer Science and Mathematics Hate Speech Detection BERT Feature Extraction Finetuning challenges
title	Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review
title_full	Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review
title_fullStr	Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review
title_full_unstemmed	Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review
title_short	Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review
title_sort	addressing challenges in hate speech detection using bert based models a review
topic	Hate Speech Detection BERT Feature Extraction Finetuning challenges
url	http://journal.esj.edu.iq/index.php/IJCM/article/view/917
work_keys_str_mv	AT jinanaljawazeri addressingchallengesinhatespeechdetectionusingbertbasedmodelsareview AT mahdinsaifjasim addressingchallengesinhatespeechdetectionusingbertbasedmodelsareview

Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review

Similar Items