Feature-enriched author ranking in incomplete networks

Abstract Evaluating scientists based on their scientific production is a controversial topic. Nevertheless, bibliometrics and algorithmic approaches can assist traditional peer review in numerous tasks, such as attributing research grants, deciding scientific committees, or choosing faculty promotio...

Full description

Bibliographic Details
Main Authors: Jorge Silva, David Aparício, Fernando Silva
Format: Article
Language:English
Published: SpringerOpen 2019-09-01
Series:Applied Network Science
Subjects:
Online Access:http://link.springer.com/article/10.1007/s41109-019-0182-8
_version_ 1828768670307844096
author Jorge Silva
David Aparício
Fernando Silva
author_facet Jorge Silva
David Aparício
Fernando Silva
author_sort Jorge Silva
collection DOAJ
description Abstract Evaluating scientists based on their scientific production is a controversial topic. Nevertheless, bibliometrics and algorithmic approaches can assist traditional peer review in numerous tasks, such as attributing research grants, deciding scientific committees, or choosing faculty promotions. Traditional bibliometrics rank individual entities (e.g., researchers, journals, faculties) without looking at the whole data (i.e., the whole network). Network algorithms, such as PageRank, have been used to measure node importance in a network, and have been applied to author ranking. However, traditional PageRank only uses network topology and ignores relevant features of scientific collaborations. Multiple extensions of PageRank have been proposed, more suited for author ranking. These methods enrich the network with information about the author’s productivity or the venue and year of the publication/citation. Most state-of-the-art (STOA) feature-enriched methods either ignore or do not combine effectively this information. Furthermore, STOA algorithms typically disregard that the full network is not known for most real-world cases.Here we describe OTARIOS, an author ranking method recently developed by us, which combines multiple publication/citation criteria (i.e., features) to evaluate authors. OTARIOS divides the original network into two subnetworks, insiders and outsiders, which is an adequate representation of citation networks with missing information. We evaluate OTARIOS on a set of five real networks, each with publications in distinct areas of Computer Science, and compare it against STOA methods. When matching OTARIOS’ produced ranking with a ground-truth ranking (comprised of best paper award nominations), we observe that OTARIOS is >30% more accurate than traditional PageRank (i.e., topology based method) and >20% more accurate than STOA (i.e., competing feature enriched methods). We obtain the best results when OTARIOS considers (i) the author’s publication volume and publication recency, (ii) how recently the author’s work is being cited by outsiders, and (iii) how recently the author’s work is being cited by insiders and how individual he is. Our results showcase (a) the importance of efficiently combining relevant features and (b) how to adequately perform author ranking in incomplete networks.
first_indexed 2024-12-11T13:34:01Z
format Article
id doaj.art-d9d4d308690841edae8b105def23e6b2
institution Directory Open Access Journal
issn 2364-8228
language English
last_indexed 2024-12-11T13:34:01Z
publishDate 2019-09-01
publisher SpringerOpen
record_format Article
series Applied Network Science
spelling doaj.art-d9d4d308690841edae8b105def23e6b22022-12-22T01:05:08ZengSpringerOpenApplied Network Science2364-82282019-09-014111510.1007/s41109-019-0182-8Feature-enriched author ranking in incomplete networksJorge Silva0David Aparício1Fernando Silva2CRACS/INESC TEC & Department of Computer Science, Faculty of Sciences, University of Porto (DCC-FCUP)CRACS/INESC TEC & Department of Computer Science, Faculty of Sciences, University of Porto (DCC-FCUP)CRACS/INESC TEC & Department of Computer Science, Faculty of Sciences, University of Porto (DCC-FCUP)Abstract Evaluating scientists based on their scientific production is a controversial topic. Nevertheless, bibliometrics and algorithmic approaches can assist traditional peer review in numerous tasks, such as attributing research grants, deciding scientific committees, or choosing faculty promotions. Traditional bibliometrics rank individual entities (e.g., researchers, journals, faculties) without looking at the whole data (i.e., the whole network). Network algorithms, such as PageRank, have been used to measure node importance in a network, and have been applied to author ranking. However, traditional PageRank only uses network topology and ignores relevant features of scientific collaborations. Multiple extensions of PageRank have been proposed, more suited for author ranking. These methods enrich the network with information about the author’s productivity or the venue and year of the publication/citation. Most state-of-the-art (STOA) feature-enriched methods either ignore or do not combine effectively this information. Furthermore, STOA algorithms typically disregard that the full network is not known for most real-world cases.Here we describe OTARIOS, an author ranking method recently developed by us, which combines multiple publication/citation criteria (i.e., features) to evaluate authors. OTARIOS divides the original network into two subnetworks, insiders and outsiders, which is an adequate representation of citation networks with missing information. We evaluate OTARIOS on a set of five real networks, each with publications in distinct areas of Computer Science, and compare it against STOA methods. When matching OTARIOS’ produced ranking with a ground-truth ranking (comprised of best paper award nominations), we observe that OTARIOS is >30% more accurate than traditional PageRank (i.e., topology based method) and >20% more accurate than STOA (i.e., competing feature enriched methods). We obtain the best results when OTARIOS considers (i) the author’s publication volume and publication recency, (ii) how recently the author’s work is being cited by outsiders, and (iii) how recently the author’s work is being cited by insiders and how individual he is. Our results showcase (a) the importance of efficiently combining relevant features and (b) how to adequately perform author ranking in incomplete networks.http://link.springer.com/article/10.1007/s41109-019-0182-8Node rankingCitation networksIncomplete networks
spellingShingle Jorge Silva
David Aparício
Fernando Silva
Feature-enriched author ranking in incomplete networks
Applied Network Science
Node ranking
Citation networks
Incomplete networks
title Feature-enriched author ranking in incomplete networks
title_full Feature-enriched author ranking in incomplete networks
title_fullStr Feature-enriched author ranking in incomplete networks
title_full_unstemmed Feature-enriched author ranking in incomplete networks
title_short Feature-enriched author ranking in incomplete networks
title_sort feature enriched author ranking in incomplete networks
topic Node ranking
Citation networks
Incomplete networks
url http://link.springer.com/article/10.1007/s41109-019-0182-8
work_keys_str_mv AT jorgesilva featureenrichedauthorrankinginincompletenetworks
AT davidaparicio featureenrichedauthorrankinginincompletenetworks
AT fernandosilva featureenrichedauthorrankinginincompletenetworks