Citation analysis on Google Scholar

Google Scholar, Scopus and Web of Science are some of the most commonly used online databases for scholarly work. The mentioned databases vary in their coverage and accuracy of citation counts. This purpose of this report is to conduct citation analysis on Google Scholar. H-index, among other statis...

Full description

Bibliographic Details
Main Author: Vidur Puliani
Other Authors: Xiao Xiao Kui
Format: Final Year Project (FYP)
Language:English
Published: 2015
Subjects:
Online Access:http://hdl.handle.net/10356/62821
_version_ 1811678981902041088
author Vidur Puliani
author2 Xiao Xiao Kui
author_facet Xiao Xiao Kui
Vidur Puliani
author_sort Vidur Puliani
collection NTU
description Google Scholar, Scopus and Web of Science are some of the most commonly used online databases for scholarly work. The mentioned databases vary in their coverage and accuracy of citation counts. This purpose of this report is to conduct citation analysis on Google Scholar. H-index, among other statistics, is a widely used measure to evaluate the number of citations of an author. H-index is often used to evaluate the impact of an author’s work on his or her peers and used as an evaluation tool for grants and promotions. Given the importance of h-index as a measure, it is important to identify and extract any possible distortions to give an unbiased measure of an author’s influence. The total citation counts used to calculate the h-index includes self-citations. Self-citations are citations where the author of the citing paper and cited paper are the same. A higher number of self-citations might correlate with higher h-index, which does not necessarily imply a greater influence of an author’s work. Therefore, this report aims to analyse the effect of self-citation on h-index by calculating two h-index values, one using the total number of citations and the other excluding the self-citations. A python crawler was developed to collect the citation data for three authors from Google Scholar and store it in a local database for analysis. The citation analysis shows that the h-index value without self-citation decreases, albeit the effect was limited and non-uniform.
first_indexed 2024-10-01T03:01:54Z
format Final Year Project (FYP)
id ntu-10356/62821
institution Nanyang Technological University
language English
last_indexed 2024-10-01T03:01:54Z
publishDate 2015
record_format dspace
spelling ntu-10356/628212023-03-03T20:26:51Z Citation analysis on Google Scholar Vidur Puliani Xiao Xiao Kui School of Computer Engineering DRNTU::Engineering::Computer science and engineering::Mathematics of computing::Numerical analysis Google Scholar, Scopus and Web of Science are some of the most commonly used online databases for scholarly work. The mentioned databases vary in their coverage and accuracy of citation counts. This purpose of this report is to conduct citation analysis on Google Scholar. H-index, among other statistics, is a widely used measure to evaluate the number of citations of an author. H-index is often used to evaluate the impact of an author’s work on his or her peers and used as an evaluation tool for grants and promotions. Given the importance of h-index as a measure, it is important to identify and extract any possible distortions to give an unbiased measure of an author’s influence. The total citation counts used to calculate the h-index includes self-citations. Self-citations are citations where the author of the citing paper and cited paper are the same. A higher number of self-citations might correlate with higher h-index, which does not necessarily imply a greater influence of an author’s work. Therefore, this report aims to analyse the effect of self-citation on h-index by calculating two h-index values, one using the total number of citations and the other excluding the self-citations. A python crawler was developed to collect the citation data for three authors from Google Scholar and store it in a local database for analysis. The citation analysis shows that the h-index value without self-citation decreases, albeit the effect was limited and non-uniform. Bachelor of Engineering (Computer Science) 2015-04-29T07:40:13Z 2015-04-29T07:40:13Z 2015 2015 Final Year Project (FYP) http://hdl.handle.net/10356/62821 en Nanyang Technological University 37 p. application/pdf
spellingShingle DRNTU::Engineering::Computer science and engineering::Mathematics of computing::Numerical analysis
Vidur Puliani
Citation analysis on Google Scholar
title Citation analysis on Google Scholar
title_full Citation analysis on Google Scholar
title_fullStr Citation analysis on Google Scholar
title_full_unstemmed Citation analysis on Google Scholar
title_short Citation analysis on Google Scholar
title_sort citation analysis on google scholar
topic DRNTU::Engineering::Computer science and engineering::Mathematics of computing::Numerical analysis
url http://hdl.handle.net/10356/62821
work_keys_str_mv AT vidurpuliani citationanalysisongooglescholar