Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review
The rapid growth of social media platforms has led to an increase in hate speech. This has prompted the development of effective detection mechanisms that aim to mitigate the potential hazards and threats it poses to society. BERT (Bidirectional Encoder Representations from Transformers) has produ...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
College of Education, Al-Iraqia University
2024-03-01
|
Series: | Iraqi Journal for Computer Science and Mathematics |
Subjects: | |
Online Access: | http://journal.esj.edu.iq/index.php/IJCM/article/view/917 |
_version_ | 1827316970974871552 |
---|---|
author | Jinan Aljawazeri Mahdi Nsaif Jasim |
author_facet | Jinan Aljawazeri Mahdi Nsaif Jasim |
author_sort | Jinan Aljawazeri |
collection | DOAJ |
description |
The rapid growth of social media platforms has led to an increase in hate speech. This has prompted
the development of effective detection mechanisms that aim to mitigate the potential hazards and threats it poses to
society. BERT (Bidirectional Encoder Representations from Transformers) has produced cutting-edge results in
this field. This review paper aims to identify and analyze the whole process of using the BERT model to tackle the
challenges associated with the hate speech detection problem. This academic discussion will begin by addressing
the training datasets and the preprocessing methods involved. Subsequently, the use of the BERT model will be
explored, followed by an examination of the contributions made to address the issues encountered. Finally, we will
discuss the evaluation phase. The use of BERT included the application of two primary approaches. In the featurebased approach, BERT accepts textual input and generates its corresponding representation as output. The resulting
output is then used as input for any classification model. The second approach involves the process of fine-tuning
BERT using labeled datasets and then employing it directly for classification purposes. The controversial issues
and open challenges that appeared at each stage were discussed. The results indicate that in both approaches,
BERT has shown its efficacy relative to other models under contention. However, there is a need for greater
attention and advancement to effectively solve the existing issues and constraints in the future.
|
first_indexed | 2024-04-24T23:24:37Z |
format | Article |
id | doaj.art-2747fb7253054fbcb82857578da9cf61 |
institution | Directory Open Access Journal |
issn | 2958-0544 2788-7421 |
language | English |
last_indexed | 2024-04-24T23:24:37Z |
publishDate | 2024-03-01 |
publisher | College of Education, Al-Iraqia University |
record_format | Article |
series | Iraqi Journal for Computer Science and Mathematics |
spelling | doaj.art-2747fb7253054fbcb82857578da9cf612024-03-16T02:41:27ZengCollege of Education, Al-Iraqia UniversityIraqi Journal for Computer Science and Mathematics2958-05442788-74212024-03-015210.52866/ijcsm.2024.05.02.001Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review Jinan Aljawazeri0Mahdi Nsaif Jasim1University of BabylonUniversity of Information Technology and Communications The rapid growth of social media platforms has led to an increase in hate speech. This has prompted the development of effective detection mechanisms that aim to mitigate the potential hazards and threats it poses to society. BERT (Bidirectional Encoder Representations from Transformers) has produced cutting-edge results in this field. This review paper aims to identify and analyze the whole process of using the BERT model to tackle the challenges associated with the hate speech detection problem. This academic discussion will begin by addressing the training datasets and the preprocessing methods involved. Subsequently, the use of the BERT model will be explored, followed by an examination of the contributions made to address the issues encountered. Finally, we will discuss the evaluation phase. The use of BERT included the application of two primary approaches. In the featurebased approach, BERT accepts textual input and generates its corresponding representation as output. The resulting output is then used as input for any classification model. The second approach involves the process of fine-tuning BERT using labeled datasets and then employing it directly for classification purposes. The controversial issues and open challenges that appeared at each stage were discussed. The results indicate that in both approaches, BERT has shown its efficacy relative to other models under contention. However, there is a need for greater attention and advancement to effectively solve the existing issues and constraints in the future. http://journal.esj.edu.iq/index.php/IJCM/article/view/917Hate Speech DetectionBERTFeature ExtractionFinetuningchallenges |
spellingShingle | Jinan Aljawazeri Mahdi Nsaif Jasim Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review Iraqi Journal for Computer Science and Mathematics Hate Speech Detection BERT Feature Extraction Finetuning challenges |
title | Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review |
title_full | Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review |
title_fullStr | Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review |
title_full_unstemmed | Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review |
title_short | Addressing Challenges in Hate Speech Detection using BERT-based Models: A Review |
title_sort | addressing challenges in hate speech detection using bert based models a review |
topic | Hate Speech Detection BERT Feature Extraction Finetuning challenges |
url | http://journal.esj.edu.iq/index.php/IJCM/article/view/917 |
work_keys_str_mv | AT jinanaljawazeri addressingchallengesinhatespeechdetectionusingbertbasedmodelsareview AT mahdinsaifjasim addressingchallengesinhatespeechdetectionusingbertbasedmodelsareview |