Target-Oriented Investigation of Online Abusive Attacks: A Dataset and Analysis

Despite a body of research revolving around online abusive language, aiming at different objectives such as detection, diffusion prediction, and mitigation, existing research has seldom looked at factors motivating this behaviour. To further research in this direction, we investigate the motivations...

Full description

Bibliographic Details
Main Authors: Raneem Alharthi, Rajwa Alharthi, Ravi Shekhar, Arkaitz Zubiaga
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10160004/
_version_ 1797790952647032832
author Raneem Alharthi
Rajwa Alharthi
Ravi Shekhar
Arkaitz Zubiaga
author_facet Raneem Alharthi
Rajwa Alharthi
Ravi Shekhar
Arkaitz Zubiaga
author_sort Raneem Alharthi
collection DOAJ
description Despite a body of research revolving around online abusive language, aiming at different objectives such as detection, diffusion prediction, and mitigation, existing research has seldom looked at factors motivating this behaviour. To further research in this direction, we investigate the motivations behind online abuse by looking at the characteristics of the targets of such abuse, i.e. is the abuse more prominent for specific characteristics of the targets? To enable target-oriented research into online abuse, we introduce the Online Abusive Attacks (OAA) dataset, the first benchmark dataset providing a holistic view of online abusive attacks, including social media profile data and metadata for both targets and perpetrators, in addition to context. The dataset contains 2.3K Twitter accounts, 5M tweets, and 106.9K categorised conversations. Further, we conduct an in-depth statistical analysis of online abuse centred around the targets’ characteristics. We identify two types of abusive attacks: those motivated by characteristics of the targets (identity-based attacks) and others (behavioural attacks). We find that online abusive attacks are predominantly motivated by the targets’ identities (97%), behavioural attacks accounting for a much smaller proportion (3%). Abuse is also more likely to target users who are popular and have a verified status. Interestingly, an analysis of the user bios shows no clear indication that keywords used in the bios are likely to trigger abuse. Additionally, we also look at the frequency with which perpetrators perform online abusive attacks. Our analysis shows a large number of infrequent perpetrators, with only a few recurrent perpetrators. Findings from our study have important implications for the development of abusive language detection models that incorporate an awareness of the targets to improve their potential for prediction.
first_indexed 2024-03-13T02:11:51Z
format Article
id doaj.art-3dc717974a1b4d01a5d1e795648fcc68
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-13T02:11:51Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-3dc717974a1b4d01a5d1e795648fcc682023-06-30T23:01:18ZengIEEEIEEE Access2169-35362023-01-0111641146412710.1109/ACCESS.2023.328914810160004Target-Oriented Investigation of Online Abusive Attacks: A Dataset and AnalysisRaneem Alharthi0https://orcid.org/0000-0001-8749-7352Rajwa Alharthi1Ravi Shekhar2Arkaitz Zubiaga3https://orcid.org/0000-0003-4583-3623College of Computers and Information Technology, Taif University, Ta’if, Saudi ArabiaCollege of Computers and Information Technology, Taif University, Ta’if, Saudi ArabiaSchool of Computer Science and Electronic Engineering, University of Essex, Colchester, U.KSchool of Electronic Engineering and Computer Science, Queen Mary University of London, London, U.KDespite a body of research revolving around online abusive language, aiming at different objectives such as detection, diffusion prediction, and mitigation, existing research has seldom looked at factors motivating this behaviour. To further research in this direction, we investigate the motivations behind online abuse by looking at the characteristics of the targets of such abuse, i.e. is the abuse more prominent for specific characteristics of the targets? To enable target-oriented research into online abuse, we introduce the Online Abusive Attacks (OAA) dataset, the first benchmark dataset providing a holistic view of online abusive attacks, including social media profile data and metadata for both targets and perpetrators, in addition to context. The dataset contains 2.3K Twitter accounts, 5M tweets, and 106.9K categorised conversations. Further, we conduct an in-depth statistical analysis of online abuse centred around the targets’ characteristics. We identify two types of abusive attacks: those motivated by characteristics of the targets (identity-based attacks) and others (behavioural attacks). We find that online abusive attacks are predominantly motivated by the targets’ identities (97%), behavioural attacks accounting for a much smaller proportion (3%). Abuse is also more likely to target users who are popular and have a verified status. Interestingly, an analysis of the user bios shows no clear indication that keywords used in the bios are likely to trigger abuse. Additionally, we also look at the frequency with which perpetrators perform online abusive attacks. Our analysis shows a large number of infrequent perpetrators, with only a few recurrent perpetrators. Findings from our study have important implications for the development of abusive language detection models that incorporate an awareness of the targets to improve their potential for prediction.https://ieeexplore.ieee.org/document/10160004/Abusive languageonline hatetargets characteristics of online abusesocial network abuseonline abusive attacks dataset
spellingShingle Raneem Alharthi
Rajwa Alharthi
Ravi Shekhar
Arkaitz Zubiaga
Target-Oriented Investigation of Online Abusive Attacks: A Dataset and Analysis
IEEE Access
Abusive language
online hate
targets characteristics of online abuse
social network abuse
online abusive attacks dataset
title Target-Oriented Investigation of Online Abusive Attacks: A Dataset and Analysis
title_full Target-Oriented Investigation of Online Abusive Attacks: A Dataset and Analysis
title_fullStr Target-Oriented Investigation of Online Abusive Attacks: A Dataset and Analysis
title_full_unstemmed Target-Oriented Investigation of Online Abusive Attacks: A Dataset and Analysis
title_short Target-Oriented Investigation of Online Abusive Attacks: A Dataset and Analysis
title_sort target oriented investigation of online abusive attacks a dataset and analysis
topic Abusive language
online hate
targets characteristics of online abuse
social network abuse
online abusive attacks dataset
url https://ieeexplore.ieee.org/document/10160004/
work_keys_str_mv AT raneemalharthi targetorientedinvestigationofonlineabusiveattacksadatasetandanalysis
AT rajwaalharthi targetorientedinvestigationofonlineabusiveattacksadatasetandanalysis
AT ravishekhar targetorientedinvestigationofonlineabusiveattacksadatasetandanalysis
AT arkaitzzubiaga targetorientedinvestigationofonlineabusiveattacksadatasetandanalysis