Reputation-Based Approach Toward Web Content Credibility Analysis

Web content credibility implies finding credible and correct information on the web. Recent studies have shown there is an increasing trend of users turning towards the web for searching information related to a variety of topics including health, stocks, education, politics to name few. Information...

Full description

Bibliographic Details
Main Authors: Saba Mahmood, Anwar Ghani, Ali Daud, Shahaboddin Shamshirband
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8848373/
_version_ 1811212007128432640
author Saba Mahmood
Anwar Ghani
Ali Daud
Shahaboddin Shamshirband
author_facet Saba Mahmood
Anwar Ghani
Ali Daud
Shahaboddin Shamshirband
author_sort Saba Mahmood
collection DOAJ
description Web content credibility implies finding credible and correct information on the web. Recent studies have shown there is an increasing trend of users turning towards the web for searching information related to a variety of topics including health, stocks, education, politics to name few. Information credibility is a critical factor in these domains for the decision makers. There is no limitation on the authorship of those articles and content. One criterion for evaluating credibility is to check the authority or source of information. However, there are situations when wrong information flows from credible sources. There are various approaches towards credibility assessment, broadly categorized into human-based and computational approaches. Computational approaches utilizing machine learning based techniques are computationally expensive. Reputation based approaches overcome this, however the latest work fails to take into account issue of negative referrals and utilizes simple summation as the calculation structure making it more resilient to attacks. This paper put forth verified hypothesis of direct relationship of credibility to the expertise of entity. Authors proposed a Bayesian based approach using feedback in the form of interaction among the entities to compute their expertise level, thereby showing improved results in terms of Precision, Correlation and Mean Average Error. The experiments are performed on two different datasets, one of the dataset is developed from a survey as the part of the research study. The results from the two experiments show that the reputation ranks are independent of the pattern of ratings and density of data, unlike previous techniques whose results were limited by these factors. The proposed technique gives 27% and 18% more precise results for the two experiments respectively compared to the baseline. The correlation results are also significant in both experiments for the proposed technique with significant values of 0.39 and 0.87 showing a linear relationship between predicted and original data. The paper also discusses the reputation attacks and proposes counter measures to tackle these attacks through simulation results.
first_indexed 2024-04-12T05:21:27Z
format Article
id doaj.art-99a09f50ed274fb8974d24f8a41c0763
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-12T05:21:27Z
publishDate 2019-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-99a09f50ed274fb8974d24f8a41c07632022-12-22T03:46:26ZengIEEEIEEE Access2169-35362019-01-01713995713996910.1109/ACCESS.2019.29437478848373Reputation-Based Approach Toward Web Content Credibility AnalysisSaba Mahmood0Anwar Ghani1https://orcid.org/0000-0001-7474-0405Ali Daud2https://orcid.org/0000-0002-6605-498XShahaboddin Shamshirband3Department of Computer Science and Software Engineering, International Islamic University Islamabad, Islamabad, PakistanDepartment of Computer Science and Software Engineering, International Islamic University Islamabad, Islamabad, PakistanDepartment of Computer Science and Artificial Intelligence, College of Computer Science and Engineering, University of Jeddah, Jeddah, Saudi ArabiaDepartment for Management of Science and Technology Development, Ton Duc Thang University, Ho Chi Minh City, VietnamWeb content credibility implies finding credible and correct information on the web. Recent studies have shown there is an increasing trend of users turning towards the web for searching information related to a variety of topics including health, stocks, education, politics to name few. Information credibility is a critical factor in these domains for the decision makers. There is no limitation on the authorship of those articles and content. One criterion for evaluating credibility is to check the authority or source of information. However, there are situations when wrong information flows from credible sources. There are various approaches towards credibility assessment, broadly categorized into human-based and computational approaches. Computational approaches utilizing machine learning based techniques are computationally expensive. Reputation based approaches overcome this, however the latest work fails to take into account issue of negative referrals and utilizes simple summation as the calculation structure making it more resilient to attacks. This paper put forth verified hypothesis of direct relationship of credibility to the expertise of entity. Authors proposed a Bayesian based approach using feedback in the form of interaction among the entities to compute their expertise level, thereby showing improved results in terms of Precision, Correlation and Mean Average Error. The experiments are performed on two different datasets, one of the dataset is developed from a survey as the part of the research study. The results from the two experiments show that the reputation ranks are independent of the pattern of ratings and density of data, unlike previous techniques whose results were limited by these factors. The proposed technique gives 27% and 18% more precise results for the two experiments respectively compared to the baseline. The correlation results are also significant in both experiments for the proposed technique with significant values of 0.39 and 0.87 showing a linear relationship between predicted and original data. The paper also discusses the reputation attacks and proposes counter measures to tackle these attacks through simulation results.https://ieeexplore.ieee.org/document/8848373/Web content credibilityinformation rankingreputation systemsexperts ranking
spellingShingle Saba Mahmood
Anwar Ghani
Ali Daud
Shahaboddin Shamshirband
Reputation-Based Approach Toward Web Content Credibility Analysis
IEEE Access
Web content credibility
information ranking
reputation systems
experts ranking
title Reputation-Based Approach Toward Web Content Credibility Analysis
title_full Reputation-Based Approach Toward Web Content Credibility Analysis
title_fullStr Reputation-Based Approach Toward Web Content Credibility Analysis
title_full_unstemmed Reputation-Based Approach Toward Web Content Credibility Analysis
title_short Reputation-Based Approach Toward Web Content Credibility Analysis
title_sort reputation based approach toward web content credibility analysis
topic Web content credibility
information ranking
reputation systems
experts ranking
url https://ieeexplore.ieee.org/document/8848373/
work_keys_str_mv AT sabamahmood reputationbasedapproachtowardwebcontentcredibilityanalysis
AT anwarghani reputationbasedapproachtowardwebcontentcredibilityanalysis
AT alidaud reputationbasedapproachtowardwebcontentcredibilityanalysis
AT shahaboddinshamshirband reputationbasedapproachtowardwebcontentcredibilityanalysis