Evaluation of unique identifiers used for citation linking [version 1; referees: 1 approved, 2 approved with reservations]

Unique identifiers (UID) are seen as an effective tool to create links between identical publications in databases or identify duplicates in a database. The purpose of the present study is to investigate how well UIDs work for citation linking. We have two objectives: Explore the coverage, precision...

Full description

Bibliographic Details
Main Authors: Heidi Holst Madsen, Dicte Madsen, Marianne Gauffriau
Format: Article
Language:English
Published: F1000 Research Ltd 2016-06-01
Series:F1000Research
Subjects:
Online Access:http://f1000research.com/articles/5-1539/v1
_version_ 1818148722664538112
author Heidi Holst Madsen
Dicte Madsen
Marianne Gauffriau
author_facet Heidi Holst Madsen
Dicte Madsen
Marianne Gauffriau
author_sort Heidi Holst Madsen
collection DOAJ
description Unique identifiers (UID) are seen as an effective tool to create links between identical publications in databases or identify duplicates in a database. The purpose of the present study is to investigate how well UIDs work for citation linking. We have two objectives: Explore the coverage, precision, and characteristics of publications matched versus not matched with UIDs as the match key.   Illustrate how publication sets formed by using UIDs as the match key may affect the bibliometric indicators: Number of publications, number of citations and the average number of citations per publication.   The objectives are addressed in a literature review and a case study. The literature review shows that only a few studies evaluate how well UIDs work as a match key. From the literature we identify four error types: Duplicate digital object identifiers (DOI), incorrect DOIs in reference lists and databases, DOIs not registered by the database where a bibliometric analysis is performed, and erroneous optical or special character recognition.   The case study explores the use of UIDs in the integration between the databases Pure and SciVal. Specifically journal publications in English are matched between the two databases. We find all error types except erroneous optical or special character recognition in our publication sets. In particular the duplicate DOIs constitute a problem for the calculation of bibliometric indicators as both keeping the duplicates to improve the reliability of citation counts and deleting them to improve the reliability of publication counts will distort the calculation of average number of citations per publication.   The use of UIDs as a match key in citation linking is implemented in many settings, and the availability of UIDs may become critical for the inclusion of a publication or a database in a bibliometric analysis.
first_indexed 2024-12-11T12:55:40Z
format Article
id doaj.art-3ecb3359cffb450698c3c775c5bcda2f
institution Directory Open Access Journal
issn 2046-1402
language English
last_indexed 2024-12-11T12:55:40Z
publishDate 2016-06-01
publisher F1000 Research Ltd
record_format Article
series F1000Research
spelling doaj.art-3ecb3359cffb450698c3c775c5bcda2f2022-12-22T01:06:35ZengF1000 Research LtdF1000Research2046-14022016-06-01510.12688/f1000research.8913.19591Evaluation of unique identifiers used for citation linking [version 1; referees: 1 approved, 2 approved with reservations]Heidi Holst Madsen0Dicte Madsen1Marianne Gauffriau2Faculty Library of Natural and Health Sciences, Copenhagen University Library, The Royal Library, Copenhagen, DK-2200, DenmarkFaculty Library of Natural and Health Sciences, Copenhagen University Library, The Royal Library, Copenhagen, DK-2200, DenmarkSUND Research & Innovation, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, DK-2200, DenmarkUnique identifiers (UID) are seen as an effective tool to create links between identical publications in databases or identify duplicates in a database. The purpose of the present study is to investigate how well UIDs work for citation linking. We have two objectives: Explore the coverage, precision, and characteristics of publications matched versus not matched with UIDs as the match key.   Illustrate how publication sets formed by using UIDs as the match key may affect the bibliometric indicators: Number of publications, number of citations and the average number of citations per publication.   The objectives are addressed in a literature review and a case study. The literature review shows that only a few studies evaluate how well UIDs work as a match key. From the literature we identify four error types: Duplicate digital object identifiers (DOI), incorrect DOIs in reference lists and databases, DOIs not registered by the database where a bibliometric analysis is performed, and erroneous optical or special character recognition.   The case study explores the use of UIDs in the integration between the databases Pure and SciVal. Specifically journal publications in English are matched between the two databases. We find all error types except erroneous optical or special character recognition in our publication sets. In particular the duplicate DOIs constitute a problem for the calculation of bibliometric indicators as both keeping the duplicates to improve the reliability of citation counts and deleting them to improve the reliability of publication counts will distort the calculation of average number of citations per publication.   The use of UIDs as a match key in citation linking is implemented in many settings, and the availability of UIDs may become critical for the inclusion of a publication or a database in a bibliometric analysis.http://f1000research.com/articles/5-1539/v1Data SharingPublishing & Peer Review
spellingShingle Heidi Holst Madsen
Dicte Madsen
Marianne Gauffriau
Evaluation of unique identifiers used for citation linking [version 1; referees: 1 approved, 2 approved with reservations]
F1000Research
Data Sharing
Publishing & Peer Review
title Evaluation of unique identifiers used for citation linking [version 1; referees: 1 approved, 2 approved with reservations]
title_full Evaluation of unique identifiers used for citation linking [version 1; referees: 1 approved, 2 approved with reservations]
title_fullStr Evaluation of unique identifiers used for citation linking [version 1; referees: 1 approved, 2 approved with reservations]
title_full_unstemmed Evaluation of unique identifiers used for citation linking [version 1; referees: 1 approved, 2 approved with reservations]
title_short Evaluation of unique identifiers used for citation linking [version 1; referees: 1 approved, 2 approved with reservations]
title_sort evaluation of unique identifiers used for citation linking version 1 referees 1 approved 2 approved with reservations
topic Data Sharing
Publishing & Peer Review
url http://f1000research.com/articles/5-1539/v1
work_keys_str_mv AT heidiholstmadsen evaluationofuniqueidentifiersusedforcitationlinkingversion1referees1approved2approvedwithreservations
AT dictemadsen evaluationofuniqueidentifiersusedforcitationlinkingversion1referees1approved2approvedwithreservations
AT mariannegauffriau evaluationofuniqueidentifiersusedforcitationlinkingversion1referees1approved2approvedwithreservations