Opening Up Climate Research: A Linked Data Approach to Publishing Data Provenance

Traditionally, the formal scientific output in most fields of natural science has been limited to peer-reviewed academic journal publications, with less attention paid to the chain of intermediate data results and their associated metadata, including provenance. In effect, this has constrained the r...

Full description

Bibliographic Details
Main Authors: Arif Shaon, Sarah Callaghan, Bryan Lawrence, Brian Matthews, Timothy Osborn, Colin Harpham, Andrew Woolf
Format: Article
Language:English
Published: University of Edinburgh 2012-03-01
Series:International Journal of Digital Curation
Online Access:https://ijdc.net/index.php/ijdc/article/view/223
_version_ 1797323834662060032
author Arif Shaon
Sarah Callaghan
Bryan Lawrence
Brian Matthews
Timothy Osborn
Colin Harpham
Andrew Woolf
author_facet Arif Shaon
Sarah Callaghan
Bryan Lawrence
Brian Matthews
Timothy Osborn
Colin Harpham
Andrew Woolf
author_sort Arif Shaon
collection DOAJ
description Traditionally, the formal scientific output in most fields of natural science has been limited to peer-reviewed academic journal publications, with less attention paid to the chain of intermediate data results and their associated metadata, including provenance. In effect, this has constrained the representation and verification of the data provenance to the confines of the related publications. Detailed knowledge of a dataset’s provenance is essential to establish the pedigree of the data for its effective re-use, and to avoid redundant re-enactment of the experiment or computation involved. It is increasingly important for open-access data to determine their authenticity and quality, especially considering the growing volumes of datasets appearing in the public domain. To address these issues, we present an approach that combines the Digital Object Identifier (DOI) – a widely adopted citation technique – with existing, widely adopted climate science data standards to formally publish detailed provenance of a climate research dataset as an associated scientific workflow. This is integrated with linked-data compliant data re-use standards (e.g. OAI-ORE) to enable a seamless link between a publication and the complete trail of lineage of the corresponding dataset, including the dataset itself.
first_indexed 2024-03-08T05:34:53Z
format Article
id doaj.art-4bddd4cd0c4d4871b2b34d20067f1438
institution Directory Open Access Journal
issn 1746-8256
language English
last_indexed 2024-03-08T05:34:53Z
publishDate 2012-03-01
publisher University of Edinburgh
record_format Article
series International Journal of Digital Curation
spelling doaj.art-4bddd4cd0c4d4871b2b34d20067f14382024-02-06T00:07:05ZengUniversity of EdinburghInternational Journal of Digital Curation1746-82562012-03-0171Opening Up Climate Research: A Linked Data Approach to Publishing Data ProvenanceArif ShaonSarah CallaghanBryan LawrenceBrian MatthewsTimothy OsbornColin HarphamAndrew WoolfTraditionally, the formal scientific output in most fields of natural science has been limited to peer-reviewed academic journal publications, with less attention paid to the chain of intermediate data results and their associated metadata, including provenance. In effect, this has constrained the representation and verification of the data provenance to the confines of the related publications. Detailed knowledge of a dataset’s provenance is essential to establish the pedigree of the data for its effective re-use, and to avoid redundant re-enactment of the experiment or computation involved. It is increasingly important for open-access data to determine their authenticity and quality, especially considering the growing volumes of datasets appearing in the public domain. To address these issues, we present an approach that combines the Digital Object Identifier (DOI) – a widely adopted citation technique – with existing, widely adopted climate science data standards to formally publish detailed provenance of a climate research dataset as an associated scientific workflow. This is integrated with linked-data compliant data re-use standards (e.g. OAI-ORE) to enable a seamless link between a publication and the complete trail of lineage of the corresponding dataset, including the dataset itself.https://ijdc.net/index.php/ijdc/article/view/223
spellingShingle Arif Shaon
Sarah Callaghan
Bryan Lawrence
Brian Matthews
Timothy Osborn
Colin Harpham
Andrew Woolf
Opening Up Climate Research: A Linked Data Approach to Publishing Data Provenance
International Journal of Digital Curation
title Opening Up Climate Research: A Linked Data Approach to Publishing Data Provenance
title_full Opening Up Climate Research: A Linked Data Approach to Publishing Data Provenance
title_fullStr Opening Up Climate Research: A Linked Data Approach to Publishing Data Provenance
title_full_unstemmed Opening Up Climate Research: A Linked Data Approach to Publishing Data Provenance
title_short Opening Up Climate Research: A Linked Data Approach to Publishing Data Provenance
title_sort opening up climate research a linked data approach to publishing data provenance
url https://ijdc.net/index.php/ijdc/article/view/223
work_keys_str_mv AT arifshaon openingupclimateresearchalinkeddataapproachtopublishingdataprovenance
AT sarahcallaghan openingupclimateresearchalinkeddataapproachtopublishingdataprovenance
AT bryanlawrence openingupclimateresearchalinkeddataapproachtopublishingdataprovenance
AT brianmatthews openingupclimateresearchalinkeddataapproachtopublishingdataprovenance
AT timothyosborn openingupclimateresearchalinkeddataapproachtopublishingdataprovenance
AT colinharpham openingupclimateresearchalinkeddataapproachtopublishingdataprovenance
AT andrewwoolf openingupclimateresearchalinkeddataapproachtopublishingdataprovenance