Discovery and Reuse of Open Datasets: An Exploratory Study

Objective: This article analyzes twenty cited or downloaded datasets and the repositories that house them, in order to produce insights that can be used by academic libraries to encourage discovery and reuse of research data in institutional repositories. Methods: Using Thomson Reuters’ Data Cita...

Full description

Bibliographic Details
Main Authors: Sara, Leila, Susan
Format: Article
Language:English
Published: UMass Chan Medical School, Lamar Soutter Library 2016-07-01
Series:Journal of eScience Librarianship
Subjects:
Online Access:http://escholarship.umassmed.edu/jeslib/vol5/iss1/5/
_version_ 1828075005607411712
author Sara
Leila
Susan
author_facet Sara
Leila
Susan
author_sort Sara
collection DOAJ
description Objective: This article analyzes twenty cited or downloaded datasets and the repositories that house them, in order to produce insights that can be used by academic libraries to encourage discovery and reuse of research data in institutional repositories. Methods: Using Thomson Reuters’ Data Citation Index and repository download statistics, we identified twenty cited/downloaded datasets. We documented the characteristics of the cited/downloaded datasets and their corresponding repositories in a self-designed rubric. The rubric includes six major categories: basic information; funding agency and journal information; linking and sharing; factors to encourage reuse; repository characteristics; and data description. Results: Our small-scale study suggests that cited/downloaded datasets generally comply with basic recommendations for facilitating reuse: data are documented well; formatted for use with a variety of software; and shared in established, open access repositories. Three significant factors also appear to contribute to dataset discovery: publishing in discipline-specific repositories; indexing in more than one location on the web; and using persistent identifiers. The cited/downloaded datasets in our analysis came from a few specific disciplines, and tended to be funded by agencies with data publication mandates. Conclusions: The results of this exploratory research provide insights that can inform academic librarians as they work to encourage discovery and reuse of institutional datasets. Our analysis also suggests areas in which academic librarians can target open data advocacy in their communities in order to begin to build open data success stories that will fuel future advocacy efforts.
first_indexed 2024-04-11T01:51:39Z
format Article
id doaj.art-193282c9244c43a1a0d89ee6ce186534
institution Directory Open Access Journal
issn 2161-3974
language English
last_indexed 2024-04-11T01:51:39Z
publishDate 2016-07-01
publisher UMass Chan Medical School, Lamar Soutter Library
record_format Article
series Journal of eScience Librarianship
spelling doaj.art-193282c9244c43a1a0d89ee6ce1865342023-01-03T06:08:23ZengUMass Chan Medical School, Lamar Soutter LibraryJournal of eScience Librarianship2161-39742016-07-0151e109110.7191/jeslib.2016.1091Discovery and Reuse of Open Datasets: An Exploratory StudySara0Leila1Susan2MannheimerStermanBordaObjective: This article analyzes twenty cited or downloaded datasets and the repositories that house them, in order to produce insights that can be used by academic libraries to encourage discovery and reuse of research data in institutional repositories. Methods: Using Thomson Reuters’ Data Citation Index and repository download statistics, we identified twenty cited/downloaded datasets. We documented the characteristics of the cited/downloaded datasets and their corresponding repositories in a self-designed rubric. The rubric includes six major categories: basic information; funding agency and journal information; linking and sharing; factors to encourage reuse; repository characteristics; and data description. Results: Our small-scale study suggests that cited/downloaded datasets generally comply with basic recommendations for facilitating reuse: data are documented well; formatted for use with a variety of software; and shared in established, open access repositories. Three significant factors also appear to contribute to dataset discovery: publishing in discipline-specific repositories; indexing in more than one location on the web; and using persistent identifiers. The cited/downloaded datasets in our analysis came from a few specific disciplines, and tended to be funded by agencies with data publication mandates. Conclusions: The results of this exploratory research provide insights that can inform academic librarians as they work to encourage discovery and reuse of institutional datasets. Our analysis also suggests areas in which academic librarians can target open data advocacy in their communities in order to begin to build open data success stories that will fuel future advocacy efforts.http://escholarship.umassmed.edu/jeslib/vol5/iss1/5/open datadata discoverydata reuseinstitutional data repositories
spellingShingle Sara
Leila
Susan
Discovery and Reuse of Open Datasets: An Exploratory Study
Journal of eScience Librarianship
open data
data discovery
data reuse
institutional data repositories
title Discovery and Reuse of Open Datasets: An Exploratory Study
title_full Discovery and Reuse of Open Datasets: An Exploratory Study
title_fullStr Discovery and Reuse of Open Datasets: An Exploratory Study
title_full_unstemmed Discovery and Reuse of Open Datasets: An Exploratory Study
title_short Discovery and Reuse of Open Datasets: An Exploratory Study
title_sort discovery and reuse of open datasets an exploratory study
topic open data
data discovery
data reuse
institutional data repositories
url http://escholarship.umassmed.edu/jeslib/vol5/iss1/5/
work_keys_str_mv AT sara discoveryandreuseofopendatasetsanexploratorystudy
AT leila discoveryandreuseofopendatasetsanexploratorystudy
AT susan discoveryandreuseofopendatasetsanexploratorystudy