Detecting Pseudo-Manipulated Citations in Scientific Literature through Perturbations of the Citation Graph

Ensuring the integrity of scientific literature is essential for advancing knowledge and research. However, the credibility and trustworthiness of scholarly publications are compromised by manipulated citations. Traditional methods, such as manual inspection and basic statistical analyses, have limi...

Full description

Bibliographic Details
Main Authors: Renata Avros, Saar Keshet, Dvora Toledano Kitai, Evgeny Vexler, Zeev Volkovich
Format: Article
Language:English
Published: MDPI AG 2023-09-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/11/18/3820
Description
Summary:Ensuring the integrity of scientific literature is essential for advancing knowledge and research. However, the credibility and trustworthiness of scholarly publications are compromised by manipulated citations. Traditional methods, such as manual inspection and basic statistical analyses, have limitations in detecting intricate patterns and subtle manipulations of citations. In recent years, network-based approaches have emerged as promising techniques for identifying and understanding citation manipulation. This study introduces a novel method to identify potential citation manipulation in academic papers using perturbations of a deep embedding model. The key idea is to reconstruct meaningful connections represented by citations within a network by exploring, to some extent, longer alternative paths. These indirect pathways enable the recovery of reliable citations while estimating their trustworthiness. The investigation takes a comprehensive approach to link prediction, leveraging the consistent behavior of prominent connections when exposed to network perturbations. Through numerical experiments, the method demonstrates a high capability to identify reliable citations as the core of the analyzed data and to raise suspicions about unreliable references that may have been manipulated. This research presents a refined method for tackling the urgent problem of citation manipulation in academic papers. It harnesses statistical sampling and graph-embedding techniques to evaluate the credibility of scholarly publications with a substantial assessment of the whole citation graph.
ISSN:2227-7390