Harmonizing the Metadata Among Diverse Climate Change Datasets
One of the critical problems in the curation of research data is the harmonization of its internal metadata schemata. The value of harmonizing such data is well illustrated by the Berkeley Earth project, which successfully integrated into one metadata schema the raw climate datasets from a wide var...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
University of Edinburgh
2015-05-01
|
Series: | International Journal of Digital Curation |
Online Access: | https://ijdc.net/index.php/ijdc/article/view/367 |
_version_ | 1797323936636076032 |
---|---|
author | André Vellino |
author_facet | André Vellino |
author_sort | André Vellino |
collection | DOAJ |
description |
One of the critical problems in the curation of research data is the harmonization of its internal metadata schemata. The value of harmonizing such data is well illustrated by the Berkeley Earth project, which successfully integrated into one metadata schema the raw climate datasets from a wide variety geographical sources and time periods (250 years). Doing this enabled climate scientists to calculate a more accurate estimate of the recent changes in Earth’s average land surface temperatures and to ascertain the extent to which climate change is anthropogenic.
This paper surveys some of the approaches that have been taken to the integration of data schemata in general and examines some of the specific metadata features of the source surface temperature datasets that were harmonized by Berkeley Earth. The conclusion drawn from this analysis is that the original source data and the Berkeley Earth common format provides a promising training set on which to apply machine learning methods for replicating the human data integration process. This paper describes research in progress on a domain-independent approach to the metadata harmonization problem that could be applied to other fields of study and be incorporated into a data portal to enhance the discoverability and reuse of data from a broad range of data sources.
|
first_indexed | 2024-03-08T05:35:15Z |
format | Article |
id | doaj.art-b1ad47c4c3614f53bae037364b65fa02 |
institution | Directory Open Access Journal |
issn | 1746-8256 |
language | English |
last_indexed | 2024-03-08T05:35:15Z |
publishDate | 2015-05-01 |
publisher | University of Edinburgh |
record_format | Article |
series | International Journal of Digital Curation |
spelling | doaj.art-b1ad47c4c3614f53bae037364b65fa022024-02-06T00:06:27ZengUniversity of EdinburghInternational Journal of Digital Curation1746-82562015-05-01101Harmonizing the Metadata Among Diverse Climate Change DatasetsAndré Vellino One of the critical problems in the curation of research data is the harmonization of its internal metadata schemata. The value of harmonizing such data is well illustrated by the Berkeley Earth project, which successfully integrated into one metadata schema the raw climate datasets from a wide variety geographical sources and time periods (250 years). Doing this enabled climate scientists to calculate a more accurate estimate of the recent changes in Earth’s average land surface temperatures and to ascertain the extent to which climate change is anthropogenic. This paper surveys some of the approaches that have been taken to the integration of data schemata in general and examines some of the specific metadata features of the source surface temperature datasets that were harmonized by Berkeley Earth. The conclusion drawn from this analysis is that the original source data and the Berkeley Earth common format provides a promising training set on which to apply machine learning methods for replicating the human data integration process. This paper describes research in progress on a domain-independent approach to the metadata harmonization problem that could be applied to other fields of study and be incorporated into a data portal to enhance the discoverability and reuse of data from a broad range of data sources. https://ijdc.net/index.php/ijdc/article/view/367 |
spellingShingle | André Vellino Harmonizing the Metadata Among Diverse Climate Change Datasets International Journal of Digital Curation |
title | Harmonizing the Metadata Among Diverse Climate Change Datasets |
title_full | Harmonizing the Metadata Among Diverse Climate Change Datasets |
title_fullStr | Harmonizing the Metadata Among Diverse Climate Change Datasets |
title_full_unstemmed | Harmonizing the Metadata Among Diverse Climate Change Datasets |
title_short | Harmonizing the Metadata Among Diverse Climate Change Datasets |
title_sort | harmonizing the metadata among diverse climate change datasets |
url | https://ijdc.net/index.php/ijdc/article/view/367 |
work_keys_str_mv | AT andrevellino harmonizingthemetadataamongdiverseclimatechangedatasets |