Informative Provenance for Repurposed Data: A Case Study using Clinical Research Data

The task repurposing of heterogeneous, distributed data for originally unintended research objectives is a non-trivial problem because the mappings required may not be precise. A particular case is clinical data collected for patient care being used for medical research. The fact that research repos...

Full description

Bibliographic Details
Main Authors: Richard Bache, Simon Miles, Bolaji Coker, Adel Taweel
Format: Article
Language:English
Published: University of Edinburgh 2013-11-01
Series:International Journal of Digital Curation
Online Access:http://129.215.67.233:80/ijdc/article/view/262
_version_ 1797402079144181760
author Richard Bache
Simon Miles
Bolaji Coker
Adel Taweel
author_facet Richard Bache
Simon Miles
Bolaji Coker
Adel Taweel
author_sort Richard Bache
collection DOAJ
description The task repurposing of heterogeneous, distributed data for originally unintended research objectives is a non-trivial problem because the mappings required may not be precise. A particular case is clinical data collected for patient care being used for medical research. The fact that research repositories will record data differently means that assumptions must be made as how to transform of this data. Records of provenance that document how this process has taken place will enable users of the data warehouse to utilise the data appropriately and ensure that future data added from another source is transformed using comparable assumptions. For a provenance-based approach to be reusable and supportable with software tools, the provenance records must use a well-defined model of the transformation process. In this paper, we propose such a model, including a classification of the individual ‘sub-functions’ that make up the overall transformation. This model enables meaningful provenance data to be generated automatically. A case study is used to illustrate this approach and an initial classification of transformations that alter the information is created.
first_indexed 2024-03-09T02:19:13Z
format Article
id doaj.art-cb91968693bf4255b2ad3742cf46e513
institution Directory Open Access Journal
issn 1746-8256
language English
last_indexed 2024-03-09T02:19:13Z
publishDate 2013-11-01
publisher University of Edinburgh
record_format Article
series International Journal of Digital Curation
spelling doaj.art-cb91968693bf4255b2ad3742cf46e5132023-12-06T20:02:39ZengUniversity of EdinburghInternational Journal of Digital Curation1746-82562013-11-0182Informative Provenance for Repurposed Data: A Case Study using Clinical Research DataRichard BacheSimon MilesBolaji CokerAdel TaweelThe task repurposing of heterogeneous, distributed data for originally unintended research objectives is a non-trivial problem because the mappings required may not be precise. A particular case is clinical data collected for patient care being used for medical research. The fact that research repositories will record data differently means that assumptions must be made as how to transform of this data. Records of provenance that document how this process has taken place will enable users of the data warehouse to utilise the data appropriately and ensure that future data added from another source is transformed using comparable assumptions. For a provenance-based approach to be reusable and supportable with software tools, the provenance records must use a well-defined model of the transformation process. In this paper, we propose such a model, including a classification of the individual ‘sub-functions’ that make up the overall transformation. This model enables meaningful provenance data to be generated automatically. A case study is used to illustrate this approach and an initial classification of transformations that alter the information is created.http://129.215.67.233:80/ijdc/article/view/262
spellingShingle Richard Bache
Simon Miles
Bolaji Coker
Adel Taweel
Informative Provenance for Repurposed Data: A Case Study using Clinical Research Data
International Journal of Digital Curation
title Informative Provenance for Repurposed Data: A Case Study using Clinical Research Data
title_full Informative Provenance for Repurposed Data: A Case Study using Clinical Research Data
title_fullStr Informative Provenance for Repurposed Data: A Case Study using Clinical Research Data
title_full_unstemmed Informative Provenance for Repurposed Data: A Case Study using Clinical Research Data
title_short Informative Provenance for Repurposed Data: A Case Study using Clinical Research Data
title_sort informative provenance for repurposed data a case study using clinical research data
url http://129.215.67.233:80/ijdc/article/view/262
work_keys_str_mv AT richardbache informativeprovenanceforrepurposeddataacasestudyusingclinicalresearchdata
AT simonmiles informativeprovenanceforrepurposeddataacasestudyusingclinicalresearchdata
AT bolajicoker informativeprovenanceforrepurposeddataacasestudyusingclinicalresearchdata
AT adeltaweel informativeprovenanceforrepurposeddataacasestudyusingclinicalresearchdata