A Scientific Paper Recommendation Framework Based on Multi-Topic Communities and Modified PageRank

Personalized PageRank is a variant of PageRank, widely developed for citation recommendation. However, the personalized PageRank that works with a vast amount and rich scholarly data still results in information overload. Sometimes, junior scholars still need help to arrange queries quickly because...

Full description

Bibliographic Details
Main Authors: Agung Hadhiatma, Azhari Azhari, Yohanes Suyanto
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10056944/
Description
Summary:Personalized PageRank is a variant of PageRank, widely developed for citation recommendation. However, the personalized PageRank that works with a vast amount and rich scholarly data still results in information overload. Sometimes, junior scholars still need help to arrange queries quickly because of limited domain knowledge. Senior researchers need reference papers regarding a similar topic they intend to search for and related topics as a new insight. In this research, scientific citation recommendation aims to find the most influential papers with similar and related topics. Related topic papers in serendipitous perspectives are reference papers that are novel, diversified and unexpected to a user. The unexpectedness of recommended papers can be papers with different topics to queries but still relevant. To accomplish these challenges, we propose a framework of scientific citation recommendation with serendipitous perspectives. The framework includes feature extraction of an academic citation network, selection of multi-topic communities, and ranking papers in the selected multi-topic communities by modified PageRank. Papers in the chosen communities tend to link to similar and related papers. Modified PageRank is an extension of personalized PageRank, which works on multi-topic communities and manuscript queries. The experiments reveal that the proposed models outperform some models of personalized PageRank and some models of Content-Based Filtering. The multi-topic communities-based models work more effectively than the baselines if they run in a large dataset since the topic communities become more cohesive.
ISSN:2169-3536