Arabic Rumor Detection Using Contextual Deep Bidirectional Language Modeling

In today’s world, news outlets have changed dramatically; newspapers are obsolete, and radio is no longer in the picture. People look for news online and on social media, such as Twitter and Facebook. Social media contributors share information and trending stories before verifying their...

Full description

Bibliographic Details
Main Authors: Naelah O. Bahurmuz, Ghada A. Amoudi, Fatmah A. Baothman, Amani T. Jamal, Hanan S. Alghamdi, Areej M. Alhothali
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9931021/
_version_ 1811233078840918016
author Naelah O. Bahurmuz
Ghada A. Amoudi
Fatmah A. Baothman
Amani T. Jamal
Hanan S. Alghamdi
Areej M. Alhothali
author_facet Naelah O. Bahurmuz
Ghada A. Amoudi
Fatmah A. Baothman
Amani T. Jamal
Hanan S. Alghamdi
Areej M. Alhothali
author_sort Naelah O. Bahurmuz
collection DOAJ
description In today’s world, news outlets have changed dramatically; newspapers are obsolete, and radio is no longer in the picture. People look for news online and on social media, such as Twitter and Facebook. Social media contributors share information and trending stories before verifying their truthfulness, thus, spreading rumors. Early identification of rumors from social media has attracted many researchers. However, a relatively smaller number of studies focused on other languages, such as Arabic. In this study, an Arabic rumor detection model is proposed. The model was built using transformer-based deep learning architecture. According to the literature, transformers are neural networks with outstanding performance in natural language processing tasks. Two transformers-based models, AraBERT and MARBERT, were employed, tested, and evaluated using three recently developed Arabic datasets. These models are extensions to the BERT, Bidirectional Encoder Representations from Transformers, a deep learning model that uses transformer architecture to learn the text representations and leverages the attention mechanism. We have also mitigated the challenges introduced by the imbalanced training datasets by employing two sampling techniques. The experimental results of our proposed approaches achieved a maximum accuracy of 0.97. This result demonstrated the effectiveness of the proposed method and outperformed other existing Arabic rumor detection methods.
first_indexed 2024-04-12T11:15:32Z
format Article
id doaj.art-235d8337f98344d2a37a5167e93ae955
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-12T11:15:32Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-235d8337f98344d2a37a5167e93ae9552022-12-22T03:35:31ZengIEEEIEEE Access2169-35362022-01-011011490711491810.1109/ACCESS.2022.32175229931021Arabic Rumor Detection Using Contextual Deep Bidirectional Language ModelingNaelah O. Bahurmuz0Ghada A. Amoudi1https://orcid.org/0000-0001-5640-7166Fatmah A. Baothman2https://orcid.org/0000-0003-0344-1007Amani T. Jamal3Hanan S. Alghamdi4https://orcid.org/0000-0003-2059-7687Areej M. Alhothali5https://orcid.org/0000-0001-9727-0178Department of Information Systems, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi ArabiaDepartment of Information Systems, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi ArabiaDepartment of Information Systems, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi ArabiaDepartment of Computer Science, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi ArabiaDepartment of Information Systems, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi ArabiaDepartment of Computer Science, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi ArabiaIn today’s world, news outlets have changed dramatically; newspapers are obsolete, and radio is no longer in the picture. People look for news online and on social media, such as Twitter and Facebook. Social media contributors share information and trending stories before verifying their truthfulness, thus, spreading rumors. Early identification of rumors from social media has attracted many researchers. However, a relatively smaller number of studies focused on other languages, such as Arabic. In this study, an Arabic rumor detection model is proposed. The model was built using transformer-based deep learning architecture. According to the literature, transformers are neural networks with outstanding performance in natural language processing tasks. Two transformers-based models, AraBERT and MARBERT, were employed, tested, and evaluated using three recently developed Arabic datasets. These models are extensions to the BERT, Bidirectional Encoder Representations from Transformers, a deep learning model that uses transformer architecture to learn the text representations and leverages the attention mechanism. We have also mitigated the challenges introduced by the imbalanced training datasets by employing two sampling techniques. The experimental results of our proposed approaches achieved a maximum accuracy of 0.97. This result demonstrated the effectiveness of the proposed method and outperformed other existing Arabic rumor detection methods.https://ieeexplore.ieee.org/document/9931021/Classificationdeep learningfake newsimbalanced datamachine learningnatural language processing
spellingShingle Naelah O. Bahurmuz
Ghada A. Amoudi
Fatmah A. Baothman
Amani T. Jamal
Hanan S. Alghamdi
Areej M. Alhothali
Arabic Rumor Detection Using Contextual Deep Bidirectional Language Modeling
IEEE Access
Classification
deep learning
fake news
imbalanced data
machine learning
natural language processing
title Arabic Rumor Detection Using Contextual Deep Bidirectional Language Modeling
title_full Arabic Rumor Detection Using Contextual Deep Bidirectional Language Modeling
title_fullStr Arabic Rumor Detection Using Contextual Deep Bidirectional Language Modeling
title_full_unstemmed Arabic Rumor Detection Using Contextual Deep Bidirectional Language Modeling
title_short Arabic Rumor Detection Using Contextual Deep Bidirectional Language Modeling
title_sort arabic rumor detection using contextual deep bidirectional language modeling
topic Classification
deep learning
fake news
imbalanced data
machine learning
natural language processing
url https://ieeexplore.ieee.org/document/9931021/
work_keys_str_mv AT naelahobahurmuz arabicrumordetectionusingcontextualdeepbidirectionallanguagemodeling
AT ghadaaamoudi arabicrumordetectionusingcontextualdeepbidirectionallanguagemodeling
AT fatmahabaothman arabicrumordetectionusingcontextualdeepbidirectionallanguagemodeling
AT amanitjamal arabicrumordetectionusingcontextualdeepbidirectionallanguagemodeling
AT hanansalghamdi arabicrumordetectionusingcontextualdeepbidirectionallanguagemodeling
AT areejmalhothali arabicrumordetectionusingcontextualdeepbidirectionallanguagemodeling