Enhancing the Detection of Misogynistic Content in Social Media by Transferring Knowledge From Song Phrases
Misogyny is a serious social problem that affects the mental and physical health of women and can even lead to femicide. This problem is visible and prevalent in different communication channels, such as music and social networks, encouraging and reinforcing this harmful behavior. Given this situati...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2023-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10040492/ |
_version_ | 1811163515731312640 |
---|---|
author | Ricardo Calderon-Suarez Rosa M. Ortega-Mendoza Manuel Montes-Y-Gomez Carina Toxqui-Quitl Marco A. Marquez-Vera |
author_facet | Ricardo Calderon-Suarez Rosa M. Ortega-Mendoza Manuel Montes-Y-Gomez Carina Toxqui-Quitl Marco A. Marquez-Vera |
author_sort | Ricardo Calderon-Suarez |
collection | DOAJ |
description | Misogyny is a serious social problem that affects the mental and physical health of women and can even lead to femicide. This problem is visible and prevalent in different communication channels, such as music and social networks, encouraging and reinforcing this harmful behavior. Given this situation, the automatic detection of misogynistic content on social networks is a task of increasing interest. In this regard, most current computational approaches employ a supervised machine learning strategy. The main challenge is to capture the diversity and complexity of offensive language directed at women. Accordingly, the size and quality of training data play a fundamental role in the results of the methods. In this paper, we propose a novel data augmentation approach that takes advantage of song lyrics to increase the generalization capability of methods and improve their performance. Hence, we present a methodology for automatically compiling a corpus of song phrases that show abusive and explicit words against women. The proposed approach was evaluated using English and Spanish benchmark datasets, obtaining results that outperform conventional transfer learning techniques and achieve high competitiveness compared with state-of-the-art methods. |
first_indexed | 2024-04-10T15:07:32Z |
format | Article |
id | doaj.art-79c762e6fe7a427e9ad4fb7b25a87645 |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-04-10T15:07:32Z |
publishDate | 2023-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-79c762e6fe7a427e9ad4fb7b25a876452023-02-15T00:00:49ZengIEEEIEEE Access2169-35362023-01-0111131791319010.1109/ACCESS.2023.324296510040492Enhancing the Detection of Misogynistic Content in Social Media by Transferring Knowledge From Song PhrasesRicardo Calderon-Suarez0Rosa M. Ortega-Mendoza1https://orcid.org/0000-0002-3591-6567Manuel Montes-Y-Gomez2Carina Toxqui-Quitl3Marco A. Marquez-Vera4División de Investigación y Posgrado, Universidad Politécnica de Tulancingo (UPT), Tulancingo, Hidalgo, MexicoDivisión de Investigación y Posgrado, Universidad Politécnica de Tulancingo (UPT), Tulancingo, Hidalgo, MexicoCoordinación de Ciencias Computacionales, Instituto Nacional de Astrofísica, Óptica y Electrónica (INAOE), Puebla, MexicoDivisión de Investigación y Posgrado, Universidad Politécnica de Tulancingo (UPT), Tulancingo, Hidalgo, MexicoDepartamento de Mecatrónica, Universidad Politécnica de Pachuca (UPP), Zempoala, Hidalgo, MexicoMisogyny is a serious social problem that affects the mental and physical health of women and can even lead to femicide. This problem is visible and prevalent in different communication channels, such as music and social networks, encouraging and reinforcing this harmful behavior. Given this situation, the automatic detection of misogynistic content on social networks is a task of increasing interest. In this regard, most current computational approaches employ a supervised machine learning strategy. The main challenge is to capture the diversity and complexity of offensive language directed at women. Accordingly, the size and quality of training data play a fundamental role in the results of the methods. In this paper, we propose a novel data augmentation approach that takes advantage of song lyrics to increase the generalization capability of methods and improve their performance. Hence, we present a methodology for automatically compiling a corpus of song phrases that show abusive and explicit words against women. The proposed approach was evaluated using English and Spanish benchmark datasets, obtaining results that outperform conventional transfer learning techniques and achieve high competitiveness compared with state-of-the-art methods.https://ieeexplore.ieee.org/document/10040492/Data augmentationmisogyny detectiontransfer learningsocial mediasong lyrics |
spellingShingle | Ricardo Calderon-Suarez Rosa M. Ortega-Mendoza Manuel Montes-Y-Gomez Carina Toxqui-Quitl Marco A. Marquez-Vera Enhancing the Detection of Misogynistic Content in Social Media by Transferring Knowledge From Song Phrases IEEE Access Data augmentation misogyny detection transfer learning social media song lyrics |
title | Enhancing the Detection of Misogynistic Content in Social Media by Transferring Knowledge From Song Phrases |
title_full | Enhancing the Detection of Misogynistic Content in Social Media by Transferring Knowledge From Song Phrases |
title_fullStr | Enhancing the Detection of Misogynistic Content in Social Media by Transferring Knowledge From Song Phrases |
title_full_unstemmed | Enhancing the Detection of Misogynistic Content in Social Media by Transferring Knowledge From Song Phrases |
title_short | Enhancing the Detection of Misogynistic Content in Social Media by Transferring Knowledge From Song Phrases |
title_sort | enhancing the detection of misogynistic content in social media by transferring knowledge from song phrases |
topic | Data augmentation misogyny detection transfer learning social media song lyrics |
url | https://ieeexplore.ieee.org/document/10040492/ |
work_keys_str_mv | AT ricardocalderonsuarez enhancingthedetectionofmisogynisticcontentinsocialmediabytransferringknowledgefromsongphrases AT rosamortegamendoza enhancingthedetectionofmisogynisticcontentinsocialmediabytransferringknowledgefromsongphrases AT manuelmontesygomez enhancingthedetectionofmisogynisticcontentinsocialmediabytransferringknowledgefromsongphrases AT carinatoxquiquitl enhancingthedetectionofmisogynisticcontentinsocialmediabytransferringknowledgefromsongphrases AT marcoamarquezvera enhancingthedetectionofmisogynisticcontentinsocialmediabytransferringknowledgefromsongphrases |