An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks Analysis

The rapid growth in the number of scholarly documents on the Web and in other digital platforms makes it challenging for researchers to find research publications most relevant to their information needs. This challenge has been mitigated to a greater extent by the major scholarly retrieval systems,...

Full description

Bibliographic Details
Main Authors: Shah Khalid, Shengli Wu, Abdul Wahid, Aftab Alam, Irfan Ullah
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9522111/
_version_ 1819118503150485504
author Shah Khalid
Shengli Wu
Abdul Wahid
Aftab Alam
Irfan Ullah
author_facet Shah Khalid
Shengli Wu
Abdul Wahid
Aftab Alam
Irfan Ullah
author_sort Shah Khalid
collection DOAJ
description The rapid growth in the number of scholarly documents on the Web and in other digital platforms makes it challenging for researchers to find research publications most relevant to their information needs. This challenge has been mitigated to a greater extent by the major scholarly retrieval systems, such as Google Scholar, Semantic Scholar, PubMed, CiteSeerX, and others. The reason for the success of these retrieval solutions lies in the advances in ranking approaches. However, the existing studies advocate for the fact that we are still far from the method’s effectiveness ceiling, leaving ample room for further improvement to meet the scholarly needs of users. The existing methods adopt different approaches; some use classical Information Retrieval (IR), others use semantics-aware methods, including Knowledge Graph (KG) to support scholarly search. However, we hypothesize that combining the best of both worlds can further improve search relevance. In this context, this work incorporates inverted index from the classical IR with BM25 as the weighting scheme, combined with Citation Networks Analysis (CNA) for the baseline search results, which are then re-ranked by passing the selected entities from the top-k initial search results as the search query to the KG. This way, not only the textual content but also the structural semantics of the research publications are well exploited in the retrieval processes. The goal is to exploit IR and KG-based retrieval techniques to gain insights into the behavior of both textual and structured information in the strategic ranking of scholarly articles. The proposed solution has been evaluated using the ACL Anthology Network (AAN) dataset. The results show that the proposed technique can comparatively improve the retrieval performance in terms of Normalized Discounted Cumulative Gain (nDCG) and precision rates.
first_indexed 2024-12-22T05:49:54Z
format Article
id doaj.art-81745f26257a40758c54a667ac01a873
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-22T05:49:54Z
publishDate 2021-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-81745f26257a40758c54a667ac01a8732022-12-21T18:36:54ZengIEEEIEEE Access2169-35362021-01-01912021012022610.1109/ACCESS.2021.31079399522111An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks AnalysisShah Khalid0https://orcid.org/0000-0001-5735-5863Shengli Wu1Abdul Wahid2https://orcid.org/0000-0002-8585-486XAftab Alam3https://orcid.org/0000-0001-9222-2468Irfan Ullah4https://orcid.org/0000-0003-0693-5467School of Computer Science and Communication Engineering, Jiangsu University, Zhenjiang, ChinaSchool of Computer Science and Communication Engineering, Jiangsu University, Zhenjiang, ChinaSchool of Electrical Engineering and Computer Science, National University of Science and Technology, Islamabad, PakistanCollege of Science and Engineering, Hamad Bin Khalifa University, Ar Rayyan, QatarDepartment of Computer Science, Shaheed Benazir Bhutto University, Sheringal, PakistanThe rapid growth in the number of scholarly documents on the Web and in other digital platforms makes it challenging for researchers to find research publications most relevant to their information needs. This challenge has been mitigated to a greater extent by the major scholarly retrieval systems, such as Google Scholar, Semantic Scholar, PubMed, CiteSeerX, and others. The reason for the success of these retrieval solutions lies in the advances in ranking approaches. However, the existing studies advocate for the fact that we are still far from the method’s effectiveness ceiling, leaving ample room for further improvement to meet the scholarly needs of users. The existing methods adopt different approaches; some use classical Information Retrieval (IR), others use semantics-aware methods, including Knowledge Graph (KG) to support scholarly search. However, we hypothesize that combining the best of both worlds can further improve search relevance. In this context, this work incorporates inverted index from the classical IR with BM25 as the weighting scheme, combined with Citation Networks Analysis (CNA) for the baseline search results, which are then re-ranked by passing the selected entities from the top-k initial search results as the search query to the KG. This way, not only the textual content but also the structural semantics of the research publications are well exploited in the retrieval processes. The goal is to exploit IR and KG-based retrieval techniques to gain insights into the behavior of both textual and structured information in the strategic ranking of scholarly articles. The proposed solution has been evaluated using the ACL Anthology Network (AAN) dataset. The results show that the proposed technique can comparatively improve the retrieval performance in terms of Normalized Discounted Cumulative Gain (nDCG) and precision rates.https://ieeexplore.ieee.org/document/9522111/Academic searchknowledge graphinverted indexstructure searchcitation networks analysis
spellingShingle Shah Khalid
Shengli Wu
Abdul Wahid
Aftab Alam
Irfan Ullah
An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks Analysis
IEEE Access
Academic search
knowledge graph
inverted index
structure search
citation networks analysis
title An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks Analysis
title_full An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks Analysis
title_fullStr An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks Analysis
title_full_unstemmed An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks Analysis
title_short An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks Analysis
title_sort effective scholarly search by combining inverted indices and structured search with citation networks analysis
topic Academic search
knowledge graph
inverted index
structure search
citation networks analysis
url https://ieeexplore.ieee.org/document/9522111/
work_keys_str_mv AT shahkhalid aneffectivescholarlysearchbycombininginvertedindicesandstructuredsearchwithcitationnetworksanalysis
AT shengliwu aneffectivescholarlysearchbycombininginvertedindicesandstructuredsearchwithcitationnetworksanalysis
AT abdulwahid aneffectivescholarlysearchbycombininginvertedindicesandstructuredsearchwithcitationnetworksanalysis
AT aftabalam aneffectivescholarlysearchbycombininginvertedindicesandstructuredsearchwithcitationnetworksanalysis
AT irfanullah aneffectivescholarlysearchbycombininginvertedindicesandstructuredsearchwithcitationnetworksanalysis
AT shahkhalid effectivescholarlysearchbycombininginvertedindicesandstructuredsearchwithcitationnetworksanalysis
AT shengliwu effectivescholarlysearchbycombininginvertedindicesandstructuredsearchwithcitationnetworksanalysis
AT abdulwahid effectivescholarlysearchbycombininginvertedindicesandstructuredsearchwithcitationnetworksanalysis
AT aftabalam effectivescholarlysearchbycombininginvertedindicesandstructuredsearchwithcitationnetworksanalysis
AT irfanullah effectivescholarlysearchbycombininginvertedindicesandstructuredsearchwithcitationnetworksanalysis