Administrative social science data: The challenge of reproducible research
Powerful new social science data resources are emerging. One particularly important source is administrative data, which were originally collected for organisational purposes but often contain information that is suitable for social science research. In this paper we outline the concept of reproduci...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SAGE Publishing
2016-12-01
|
Series: | Big Data & Society |
Online Access: | https://doi.org/10.1177/2053951716684143 |
_version_ | 1818448920826609664 |
---|---|
author | Christopher J Playford Vernon Gayle Roxanne Connelly Alasdair JG Gray |
author_facet | Christopher J Playford Vernon Gayle Roxanne Connelly Alasdair JG Gray |
author_sort | Christopher J Playford |
collection | DOAJ |
description | Powerful new social science data resources are emerging. One particularly important source is administrative data, which were originally collected for organisational purposes but often contain information that is suitable for social science research. In this paper we outline the concept of reproducible research in relation to micro-level administrative social science data. Our central claim is that a planned and organised workflow is essential for high quality research using micro-level administrative social science data. We argue that it is essential for researchers to share research code, because code sharing enables the elements of reproducible research. First, it enables results to be duplicated and therefore allows the accuracy and validity of analyses to be evaluated. Second, it facilitates further tests of the robustness of the original piece of research. Drawing on insights from computer science and other disciplines that have been engaged in e-Research we discuss and advocate the use of Git repositories to provide a useable and effective solution to research code sharing and rendering social science research using micro-level administrative data reproducible. |
first_indexed | 2024-12-14T20:27:11Z |
format | Article |
id | doaj.art-6c144ab1e6b74714a50172b3663375f4 |
institution | Directory Open Access Journal |
issn | 2053-9517 |
language | English |
last_indexed | 2024-12-14T20:27:11Z |
publishDate | 2016-12-01 |
publisher | SAGE Publishing |
record_format | Article |
series | Big Data & Society |
spelling | doaj.art-6c144ab1e6b74714a50172b3663375f42022-12-21T22:48:36ZengSAGE PublishingBig Data & Society2053-95172016-12-01310.1177/205395171668414310.1177_2053951716684143Administrative social science data: The challenge of reproducible researchChristopher J Playford0Vernon Gayle1Roxanne Connelly2Alasdair JG Gray3Administrative Data Research Centre – Scotland, School of Social and Political Science, University of Edinburgh, UKAdministrative Data Research Centre – Scotland, School of Social and Political Science, University of Edinburgh, UKDepartment of Sociology, University of Warwick, UKSchool of Mathematical & Computer Sciences, Heriot-Watt University, UKPowerful new social science data resources are emerging. One particularly important source is administrative data, which were originally collected for organisational purposes but often contain information that is suitable for social science research. In this paper we outline the concept of reproducible research in relation to micro-level administrative social science data. Our central claim is that a planned and organised workflow is essential for high quality research using micro-level administrative social science data. We argue that it is essential for researchers to share research code, because code sharing enables the elements of reproducible research. First, it enables results to be duplicated and therefore allows the accuracy and validity of analyses to be evaluated. Second, it facilitates further tests of the robustness of the original piece of research. Drawing on insights from computer science and other disciplines that have been engaged in e-Research we discuss and advocate the use of Git repositories to provide a useable and effective solution to research code sharing and rendering social science research using micro-level administrative data reproducible.https://doi.org/10.1177/2053951716684143 |
spellingShingle | Christopher J Playford Vernon Gayle Roxanne Connelly Alasdair JG Gray Administrative social science data: The challenge of reproducible research Big Data & Society |
title | Administrative social science data: The challenge of reproducible research |
title_full | Administrative social science data: The challenge of reproducible research |
title_fullStr | Administrative social science data: The challenge of reproducible research |
title_full_unstemmed | Administrative social science data: The challenge of reproducible research |
title_short | Administrative social science data: The challenge of reproducible research |
title_sort | administrative social science data the challenge of reproducible research |
url | https://doi.org/10.1177/2053951716684143 |
work_keys_str_mv | AT christopherjplayford administrativesocialsciencedatathechallengeofreproducibleresearch AT vernongayle administrativesocialsciencedatathechallengeofreproducibleresearch AT roxanneconnelly administrativesocialsciencedatathechallengeofreproducibleresearch AT alasdairjggray administrativesocialsciencedatathechallengeofreproducibleresearch |