Informative Provenance for Repurposed Data: A Case Study using Clinical Research Data
The task repurposing of heterogeneous, distributed data for originally unintended research objectives is a non-trivial problem because the mappings required may not be precise. A particular case is clinical data collected for patient care being used for medical research. The fact that research repos...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
University of Edinburgh
2013-11-01
|
Series: | International Journal of Digital Curation |
Online Access: | https://ijdc.net/index.php/ijdc/article/view/262 |
_version_ | 1797323896806965248 |
---|---|
author | Richard Bache Simon Miles Bolaji Coker Adel Taweel |
author_facet | Richard Bache Simon Miles Bolaji Coker Adel Taweel |
author_sort | Richard Bache |
collection | DOAJ |
description | The task repurposing of heterogeneous, distributed data for originally unintended research objectives is a non-trivial problem because the mappings required may not be precise. A particular case is clinical data collected for patient care being used for medical research. The fact that research repositories will record data differently means that assumptions must be made as how to transform of this data. Records of provenance that document how this process has taken place will enable users of the data warehouse to utilise the data appropriately and ensure that future data added from another source is transformed using comparable assumptions. For a provenance-based approach to be reusable and supportable with software tools, the provenance records must use a well-defined model of the transformation process. In this paper, we propose such a model, including a classification of the individual ‘sub-functions’ that make up the overall transformation. This model enables meaningful provenance data to be generated automatically. A case study is used to illustrate this approach and an initial classification of transformations that alter the information is created. |
first_indexed | 2024-03-08T05:34:44Z |
format | Article |
id | doaj.art-2f0f67dbede647d1992060749175b1b2 |
institution | Directory Open Access Journal |
issn | 1746-8256 |
language | English |
last_indexed | 2024-03-08T05:34:44Z |
publishDate | 2013-11-01 |
publisher | University of Edinburgh |
record_format | Article |
series | International Journal of Digital Curation |
spelling | doaj.art-2f0f67dbede647d1992060749175b1b22024-02-06T00:06:53ZengUniversity of EdinburghInternational Journal of Digital Curation1746-82562013-11-0182Informative Provenance for Repurposed Data: A Case Study using Clinical Research DataRichard BacheSimon MilesBolaji CokerAdel TaweelThe task repurposing of heterogeneous, distributed data for originally unintended research objectives is a non-trivial problem because the mappings required may not be precise. A particular case is clinical data collected for patient care being used for medical research. The fact that research repositories will record data differently means that assumptions must be made as how to transform of this data. Records of provenance that document how this process has taken place will enable users of the data warehouse to utilise the data appropriately and ensure that future data added from another source is transformed using comparable assumptions. For a provenance-based approach to be reusable and supportable with software tools, the provenance records must use a well-defined model of the transformation process. In this paper, we propose such a model, including a classification of the individual ‘sub-functions’ that make up the overall transformation. This model enables meaningful provenance data to be generated automatically. A case study is used to illustrate this approach and an initial classification of transformations that alter the information is created.https://ijdc.net/index.php/ijdc/article/view/262 |
spellingShingle | Richard Bache Simon Miles Bolaji Coker Adel Taweel Informative Provenance for Repurposed Data: A Case Study using Clinical Research Data International Journal of Digital Curation |
title | Informative Provenance for Repurposed Data: A Case Study using Clinical Research Data |
title_full | Informative Provenance for Repurposed Data: A Case Study using Clinical Research Data |
title_fullStr | Informative Provenance for Repurposed Data: A Case Study using Clinical Research Data |
title_full_unstemmed | Informative Provenance for Repurposed Data: A Case Study using Clinical Research Data |
title_short | Informative Provenance for Repurposed Data: A Case Study using Clinical Research Data |
title_sort | informative provenance for repurposed data a case study using clinical research data |
url | https://ijdc.net/index.php/ijdc/article/view/262 |
work_keys_str_mv | AT richardbache informativeprovenanceforrepurposeddataacasestudyusingclinicalresearchdata AT simonmiles informativeprovenanceforrepurposeddataacasestudyusingclinicalresearchdata AT bolajicoker informativeprovenanceforrepurposeddataacasestudyusingclinicalresearchdata AT adeltaweel informativeprovenanceforrepurposeddataacasestudyusingclinicalresearchdata |