BREE-HD: A Transformer-Based Model to Identify Threats on Twitter

With the world transitioning to an online reality and a surge in social media users, detecting online harassment and threats has become more pressing than ever. Gendered cyber-hate causes women significant social, psychological, reputational, economic, and political harm. To tackle this problem, we...

Full description

Bibliographic Details
Main Authors: Sinchana Kumbale, Smriti Singh, G. Poornalatha, Sanjay Singh
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10168907/
_version_ 1797783599412412416
author Sinchana Kumbale
Smriti Singh
G. Poornalatha
Sanjay Singh
author_facet Sinchana Kumbale
Smriti Singh
G. Poornalatha
Sanjay Singh
author_sort Sinchana Kumbale
collection DOAJ
description With the world transitioning to an online reality and a surge in social media users, detecting online harassment and threats has become more pressing than ever. Gendered cyber-hate causes women significant social, psychological, reputational, economic, and political harm. To tackle this problem, we develop a dataset and propose a transformer-based model to classify tweets into threats or non-threats that are either sexist or non-sexist. We have developed a model to identify sexist and non-sexist threats from a collection of sexist, non-sexist tweets. BREE-HD performs extraordinarily well with an accuracy of 97% when trained on the dataset we developed to detect threats from a collection of derogatory tweets. To provide insight into how BREE-HD makes classifications, we apply explainable A.I. (XAI) concepts to provide a detailed qualitative analysis of our proposed methodology. As an extension of our work, BREE-HD could be used as a part of a system that could detect threats targeting people specifically tailored to classify them in real-time adequately.
first_indexed 2024-03-13T00:29:07Z
format Article
id doaj.art-83e77dd9f0c741a1a1b924e63de503df
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-13T00:29:07Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-83e77dd9f0c741a1a1b924e63de503df2023-07-10T23:00:18ZengIEEEIEEE Access2169-35362023-01-0111671806719010.1109/ACCESS.2023.329107210168907BREE-HD: A Transformer-Based Model to Identify Threats on TwitterSinchana Kumbale0https://orcid.org/0009-0003-8989-1769Smriti Singh1https://orcid.org/0009-0001-6479-3308G. Poornalatha2Sanjay Singh3https://orcid.org/0000-0001-7212-5919Department of Computer Science and Engineering, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, IndiaDepartment of Computer Science, The University of Texas at Austin, Austin, TX, USADepartment of Information and Communication Technology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, IndiaDepartment of Information and Communication Technology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, IndiaWith the world transitioning to an online reality and a surge in social media users, detecting online harassment and threats has become more pressing than ever. Gendered cyber-hate causes women significant social, psychological, reputational, economic, and political harm. To tackle this problem, we develop a dataset and propose a transformer-based model to classify tweets into threats or non-threats that are either sexist or non-sexist. We have developed a model to identify sexist and non-sexist threats from a collection of sexist, non-sexist tweets. BREE-HD performs extraordinarily well with an accuracy of 97% when trained on the dataset we developed to detect threats from a collection of derogatory tweets. To provide insight into how BREE-HD makes classifications, we apply explainable A.I. (XAI) concepts to provide a detailed qualitative analysis of our proposed methodology. As an extension of our work, BREE-HD could be used as a part of a system that could detect threats targeting people specifically tailored to classify them in real-time adequately.https://ieeexplore.ieee.org/document/10168907/Explainable AIhate speech detectionsexism detectionthreat detectiontransformers
spellingShingle Sinchana Kumbale
Smriti Singh
G. Poornalatha
Sanjay Singh
BREE-HD: A Transformer-Based Model to Identify Threats on Twitter
IEEE Access
Explainable AI
hate speech detection
sexism detection
threat detection
transformers
title BREE-HD: A Transformer-Based Model to Identify Threats on Twitter
title_full BREE-HD: A Transformer-Based Model to Identify Threats on Twitter
title_fullStr BREE-HD: A Transformer-Based Model to Identify Threats on Twitter
title_full_unstemmed BREE-HD: A Transformer-Based Model to Identify Threats on Twitter
title_short BREE-HD: A Transformer-Based Model to Identify Threats on Twitter
title_sort bree hd a transformer based model to identify threats on twitter
topic Explainable AI
hate speech detection
sexism detection
threat detection
transformers
url https://ieeexplore.ieee.org/document/10168907/
work_keys_str_mv AT sinchanakumbale breehdatransformerbasedmodeltoidentifythreatsontwitter
AT smritisingh breehdatransformerbasedmodeltoidentifythreatsontwitter
AT gpoornalatha breehdatransformerbasedmodeltoidentifythreatsontwitter
AT sanjaysingh breehdatransformerbasedmodeltoidentifythreatsontwitter