Comparative Analysis of TF–IDF and Hashing Vectorizer for Fake News Detection in Sindhi: A Machine Learning and Deep Learning Approach
Social media has become a popular platform for accessing and sharing news, but it has also led to a rise in fake news, posing serious risks. The ease of dissemination and constant flow of information raise concerns about the spread of incorrect information. Timely verification of news is crucial to...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-09-01
|
Series: | Engineering Proceedings |
Subjects: | |
Online Access: | https://www.mdpi.com/2673-4591/46/1/5 |
_version_ | 1797381168635576320 |
---|---|
author | Rubab Roshan Irfan Ali Bhacho Sammer Zai |
author_facet | Rubab Roshan Irfan Ali Bhacho Sammer Zai |
author_sort | Rubab Roshan |
collection | DOAJ |
description | Social media has become a popular platform for accessing and sharing news, but it has also led to a rise in fake news, posing serious risks. The ease of dissemination and constant flow of information raise concerns about the spread of incorrect information. Timely verification of news is crucial to combat false news. However, most research on false news identification has focused on English, neglecting South Asian languages. This study examines a dataset of Sindhi tweets, employing text feature extraction techniques such as TF–IDF and hashing vectorizer. Several machine learning algorithms, along with advanced deep learning models such as Transformer BERT, were utilized for analysis. |
first_indexed | 2024-03-08T20:47:29Z |
format | Article |
id | doaj.art-1b8f355d98fd43dbb6b6ff687320f253 |
institution | Directory Open Access Journal |
issn | 2673-4591 |
language | English |
last_indexed | 2024-03-08T20:47:29Z |
publishDate | 2023-09-01 |
publisher | MDPI AG |
record_format | Article |
series | Engineering Proceedings |
spelling | doaj.art-1b8f355d98fd43dbb6b6ff687320f2532023-12-22T14:07:01ZengMDPI AGEngineering Proceedings2673-45912023-09-01461510.3390/engproc2023046005Comparative Analysis of TF–IDF and Hashing Vectorizer for Fake News Detection in Sindhi: A Machine Learning and Deep Learning ApproachRubab Roshan0Irfan Ali Bhacho1Sammer Zai2Department of Computer Systems Engineering, Mehran University of Engineering and Technology (MUET), Jamshoro 76062, PakistanDepartment of Computer Systems Engineering, Mehran University of Engineering and Technology (MUET), Jamshoro 76062, PakistanDepartment of Computer Systems Engineering, Mehran University of Engineering and Technology (MUET), Jamshoro 76062, PakistanSocial media has become a popular platform for accessing and sharing news, but it has also led to a rise in fake news, posing serious risks. The ease of dissemination and constant flow of information raise concerns about the spread of incorrect information. Timely verification of news is crucial to combat false news. However, most research on false news identification has focused on English, neglecting South Asian languages. This study examines a dataset of Sindhi tweets, employing text feature extraction techniques such as TF–IDF and hashing vectorizer. Several machine learning algorithms, along with advanced deep learning models such as Transformer BERT, were utilized for analysis.https://www.mdpi.com/2673-4591/46/1/5machine learningdeep learningBERTtext miningTF–IDFhashing vectorizer |
spellingShingle | Rubab Roshan Irfan Ali Bhacho Sammer Zai Comparative Analysis of TF–IDF and Hashing Vectorizer for Fake News Detection in Sindhi: A Machine Learning and Deep Learning Approach Engineering Proceedings machine learning deep learning BERT text mining TF–IDF hashing vectorizer |
title | Comparative Analysis of TF–IDF and Hashing Vectorizer for Fake News Detection in Sindhi: A Machine Learning and Deep Learning Approach |
title_full | Comparative Analysis of TF–IDF and Hashing Vectorizer for Fake News Detection in Sindhi: A Machine Learning and Deep Learning Approach |
title_fullStr | Comparative Analysis of TF–IDF and Hashing Vectorizer for Fake News Detection in Sindhi: A Machine Learning and Deep Learning Approach |
title_full_unstemmed | Comparative Analysis of TF–IDF and Hashing Vectorizer for Fake News Detection in Sindhi: A Machine Learning and Deep Learning Approach |
title_short | Comparative Analysis of TF–IDF and Hashing Vectorizer for Fake News Detection in Sindhi: A Machine Learning and Deep Learning Approach |
title_sort | comparative analysis of tf idf and hashing vectorizer for fake news detection in sindhi a machine learning and deep learning approach |
topic | machine learning deep learning BERT text mining TF–IDF hashing vectorizer |
url | https://www.mdpi.com/2673-4591/46/1/5 |
work_keys_str_mv | AT rubabroshan comparativeanalysisoftfidfandhashingvectorizerforfakenewsdetectioninsindhiamachinelearninganddeeplearningapproach AT irfanalibhacho comparativeanalysisoftfidfandhashingvectorizerforfakenewsdetectioninsindhiamachinelearninganddeeplearningapproach AT sammerzai comparativeanalysisoftfidfandhashingvectorizerforfakenewsdetectioninsindhiamachinelearninganddeeplearningapproach |