Feature-enriched author ranking in incomplete networks
Abstract Evaluating scientists based on their scientific production is a controversial topic. Nevertheless, bibliometrics and algorithmic approaches can assist traditional peer review in numerous tasks, such as attributing research grants, deciding scientific committees, or choosing faculty promotio...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2019-09-01
|
Series: | Applied Network Science |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1007/s41109-019-0182-8 |
_version_ | 1828768670307844096 |
---|---|
author | Jorge Silva David Aparício Fernando Silva |
author_facet | Jorge Silva David Aparício Fernando Silva |
author_sort | Jorge Silva |
collection | DOAJ |
description | Abstract Evaluating scientists based on their scientific production is a controversial topic. Nevertheless, bibliometrics and algorithmic approaches can assist traditional peer review in numerous tasks, such as attributing research grants, deciding scientific committees, or choosing faculty promotions. Traditional bibliometrics rank individual entities (e.g., researchers, journals, faculties) without looking at the whole data (i.e., the whole network). Network algorithms, such as PageRank, have been used to measure node importance in a network, and have been applied to author ranking. However, traditional PageRank only uses network topology and ignores relevant features of scientific collaborations. Multiple extensions of PageRank have been proposed, more suited for author ranking. These methods enrich the network with information about the author’s productivity or the venue and year of the publication/citation. Most state-of-the-art (STOA) feature-enriched methods either ignore or do not combine effectively this information. Furthermore, STOA algorithms typically disregard that the full network is not known for most real-world cases.Here we describe OTARIOS, an author ranking method recently developed by us, which combines multiple publication/citation criteria (i.e., features) to evaluate authors. OTARIOS divides the original network into two subnetworks, insiders and outsiders, which is an adequate representation of citation networks with missing information. We evaluate OTARIOS on a set of five real networks, each with publications in distinct areas of Computer Science, and compare it against STOA methods. When matching OTARIOS’ produced ranking with a ground-truth ranking (comprised of best paper award nominations), we observe that OTARIOS is >30% more accurate than traditional PageRank (i.e., topology based method) and >20% more accurate than STOA (i.e., competing feature enriched methods). We obtain the best results when OTARIOS considers (i) the author’s publication volume and publication recency, (ii) how recently the author’s work is being cited by outsiders, and (iii) how recently the author’s work is being cited by insiders and how individual he is. Our results showcase (a) the importance of efficiently combining relevant features and (b) how to adequately perform author ranking in incomplete networks. |
first_indexed | 2024-12-11T13:34:01Z |
format | Article |
id | doaj.art-d9d4d308690841edae8b105def23e6b2 |
institution | Directory Open Access Journal |
issn | 2364-8228 |
language | English |
last_indexed | 2024-12-11T13:34:01Z |
publishDate | 2019-09-01 |
publisher | SpringerOpen |
record_format | Article |
series | Applied Network Science |
spelling | doaj.art-d9d4d308690841edae8b105def23e6b22022-12-22T01:05:08ZengSpringerOpenApplied Network Science2364-82282019-09-014111510.1007/s41109-019-0182-8Feature-enriched author ranking in incomplete networksJorge Silva0David Aparício1Fernando Silva2CRACS/INESC TEC & Department of Computer Science, Faculty of Sciences, University of Porto (DCC-FCUP)CRACS/INESC TEC & Department of Computer Science, Faculty of Sciences, University of Porto (DCC-FCUP)CRACS/INESC TEC & Department of Computer Science, Faculty of Sciences, University of Porto (DCC-FCUP)Abstract Evaluating scientists based on their scientific production is a controversial topic. Nevertheless, bibliometrics and algorithmic approaches can assist traditional peer review in numerous tasks, such as attributing research grants, deciding scientific committees, or choosing faculty promotions. Traditional bibliometrics rank individual entities (e.g., researchers, journals, faculties) without looking at the whole data (i.e., the whole network). Network algorithms, such as PageRank, have been used to measure node importance in a network, and have been applied to author ranking. However, traditional PageRank only uses network topology and ignores relevant features of scientific collaborations. Multiple extensions of PageRank have been proposed, more suited for author ranking. These methods enrich the network with information about the author’s productivity or the venue and year of the publication/citation. Most state-of-the-art (STOA) feature-enriched methods either ignore or do not combine effectively this information. Furthermore, STOA algorithms typically disregard that the full network is not known for most real-world cases.Here we describe OTARIOS, an author ranking method recently developed by us, which combines multiple publication/citation criteria (i.e., features) to evaluate authors. OTARIOS divides the original network into two subnetworks, insiders and outsiders, which is an adequate representation of citation networks with missing information. We evaluate OTARIOS on a set of five real networks, each with publications in distinct areas of Computer Science, and compare it against STOA methods. When matching OTARIOS’ produced ranking with a ground-truth ranking (comprised of best paper award nominations), we observe that OTARIOS is >30% more accurate than traditional PageRank (i.e., topology based method) and >20% more accurate than STOA (i.e., competing feature enriched methods). We obtain the best results when OTARIOS considers (i) the author’s publication volume and publication recency, (ii) how recently the author’s work is being cited by outsiders, and (iii) how recently the author’s work is being cited by insiders and how individual he is. Our results showcase (a) the importance of efficiently combining relevant features and (b) how to adequately perform author ranking in incomplete networks.http://link.springer.com/article/10.1007/s41109-019-0182-8Node rankingCitation networksIncomplete networks |
spellingShingle | Jorge Silva David Aparício Fernando Silva Feature-enriched author ranking in incomplete networks Applied Network Science Node ranking Citation networks Incomplete networks |
title | Feature-enriched author ranking in incomplete networks |
title_full | Feature-enriched author ranking in incomplete networks |
title_fullStr | Feature-enriched author ranking in incomplete networks |
title_full_unstemmed | Feature-enriched author ranking in incomplete networks |
title_short | Feature-enriched author ranking in incomplete networks |
title_sort | feature enriched author ranking in incomplete networks |
topic | Node ranking Citation networks Incomplete networks |
url | http://link.springer.com/article/10.1007/s41109-019-0182-8 |
work_keys_str_mv | AT jorgesilva featureenrichedauthorrankinginincompletenetworks AT davidaparicio featureenrichedauthorrankinginincompletenetworks AT fernandosilva featureenrichedauthorrankinginincompletenetworks |