Digital Object Identifiers (DOIs) Prove Highly Effective for Long-Term Data Availability in PLOS ONE

A Review of: Federer, L. M. (2022). Long-term availability of data associated with articles in PLOS ONE. PLOS ONE 17(8), Article e0272845. https://doi.org/10.1371/journal.pone.0272845 Objective – To retrieve a range of PLOS ONE data availability statements and quantify their ability to point to...

Full description

Bibliographic Details
Main Author: Hilary Jasmin
Format: Article
Language:English
Published: University of Alberta 2023-09-01
Series:Evidence Based Library and Information Practice
Online Access:https://journals.library.ualberta.ca/eblip/index.php/EBLIP/article/view/30378
_version_ 1797660918954328064
author Hilary Jasmin
author_facet Hilary Jasmin
author_sort Hilary Jasmin
collection DOAJ
description A Review of: Federer, L. M. (2022). Long-term availability of data associated with articles in PLOS ONE. PLOS ONE 17(8), Article e0272845. https://doi.org/10.1371/journal.pone.0272845 Objective – To retrieve a range of PLOS ONE data availability statements and quantify their ability to point to the study data efficiently and accurately. Research questions focused on availability over time, availability of URLs versus DOIs, the ability to locate resources using the data availability statement and availability based on data sharing method. Design – Observational study. Setting – PLOS ONE archive. Subjects – A corpus of 47,593 data availability statements from research articles in PLOS ONE between March 1, 2014, and May 31, 2016. Methods – Use of custom R scripts to retrieve 47,593 data availability statements; of these, 6,912 (14.5%) contained at least one URL or DOI. Once these links were extracted, R scripts were run to fetch the resources and record HTTP status codes to determine if the resource was discoverable. To address the potential for the DOI or URL to fetch but not actually contain the appropriate data, the researchers selected at random and manually retrieved the data for 350 URLs and 350 DOIs. Main Results – Of the unique URLs, 75% were able to be automatically retrieved by custom R scripts. In the manual sample of 350 URLs, which was used to test for accuracy of the URLs in containing the data, there was a 78% retrieval rate. Of the unique DOIs, 90% were able to be automatically retrieved by custom R scripts. The manual sample of 350 DOIs had a 98% retrieval rate. Conclusion – DOIs, especially those linked with a repository, had the highest rate of success in retrieving the data attached to the article. While URLs were better than no link at all, URLs are susceptible to content drift and need more management for long-term data availability.
first_indexed 2024-03-11T18:37:48Z
format Article
id doaj.art-5dd1f481208c45c7bce78d71b9606069
institution Directory Open Access Journal
issn 1715-720X
language English
last_indexed 2024-03-11T18:37:48Z
publishDate 2023-09-01
publisher University of Alberta
record_format Article
series Evidence Based Library and Information Practice
spelling doaj.art-5dd1f481208c45c7bce78d71b96060692023-10-12T16:27:59ZengUniversity of AlbertaEvidence Based Library and Information Practice1715-720X2023-09-0118310.18438/eblip30378Digital Object Identifiers (DOIs) Prove Highly Effective for Long-Term Data Availability in PLOS ONEHilary Jasmin0The University of Tennessee Health Science Center, Memphis, Tennessee, United States of America A Review of: Federer, L. M. (2022). Long-term availability of data associated with articles in PLOS ONE. PLOS ONE 17(8), Article e0272845. https://doi.org/10.1371/journal.pone.0272845 Objective – To retrieve a range of PLOS ONE data availability statements and quantify their ability to point to the study data efficiently and accurately. Research questions focused on availability over time, availability of URLs versus DOIs, the ability to locate resources using the data availability statement and availability based on data sharing method. Design – Observational study. Setting – PLOS ONE archive. Subjects – A corpus of 47,593 data availability statements from research articles in PLOS ONE between March 1, 2014, and May 31, 2016. Methods – Use of custom R scripts to retrieve 47,593 data availability statements; of these, 6,912 (14.5%) contained at least one URL or DOI. Once these links were extracted, R scripts were run to fetch the resources and record HTTP status codes to determine if the resource was discoverable. To address the potential for the DOI or URL to fetch but not actually contain the appropriate data, the researchers selected at random and manually retrieved the data for 350 URLs and 350 DOIs. Main Results – Of the unique URLs, 75% were able to be automatically retrieved by custom R scripts. In the manual sample of 350 URLs, which was used to test for accuracy of the URLs in containing the data, there was a 78% retrieval rate. Of the unique DOIs, 90% were able to be automatically retrieved by custom R scripts. The manual sample of 350 DOIs had a 98% retrieval rate. Conclusion – DOIs, especially those linked with a repository, had the highest rate of success in retrieving the data attached to the article. While URLs were better than no link at all, URLs are susceptible to content drift and need more management for long-term data availability. https://journals.library.ualberta.ca/eblip/index.php/EBLIP/article/view/30378
spellingShingle Hilary Jasmin
Digital Object Identifiers (DOIs) Prove Highly Effective for Long-Term Data Availability in PLOS ONE
Evidence Based Library and Information Practice
title Digital Object Identifiers (DOIs) Prove Highly Effective for Long-Term Data Availability in PLOS ONE
title_full Digital Object Identifiers (DOIs) Prove Highly Effective for Long-Term Data Availability in PLOS ONE
title_fullStr Digital Object Identifiers (DOIs) Prove Highly Effective for Long-Term Data Availability in PLOS ONE
title_full_unstemmed Digital Object Identifiers (DOIs) Prove Highly Effective for Long-Term Data Availability in PLOS ONE
title_short Digital Object Identifiers (DOIs) Prove Highly Effective for Long-Term Data Availability in PLOS ONE
title_sort digital object identifiers dois prove highly effective for long term data availability in plos one
url https://journals.library.ualberta.ca/eblip/index.php/EBLIP/article/view/30378
work_keys_str_mv AT hilaryjasmin digitalobjectidentifiersdoisprovehighlyeffectiveforlongtermdataavailabilityinplosone