Administrative social science data: The challenge of reproducible research

Powerful new social science data resources are emerging. One particularly important source is administrative data, which were originally collected for organisational purposes but often contain information that is suitable for social science research. In this paper we outline the concept of reproduci...

Full description

Bibliographic Details
Main Authors: Christopher J Playford, Vernon Gayle, Roxanne Connelly, Alasdair JG Gray
Format: Article
Language:English
Published: SAGE Publishing 2016-12-01
Series:Big Data & Society
Online Access:https://doi.org/10.1177/2053951716684143
_version_ 1818448920826609664
author Christopher J Playford
Vernon Gayle
Roxanne Connelly
Alasdair JG Gray
author_facet Christopher J Playford
Vernon Gayle
Roxanne Connelly
Alasdair JG Gray
author_sort Christopher J Playford
collection DOAJ
description Powerful new social science data resources are emerging. One particularly important source is administrative data, which were originally collected for organisational purposes but often contain information that is suitable for social science research. In this paper we outline the concept of reproducible research in relation to micro-level administrative social science data. Our central claim is that a planned and organised workflow is essential for high quality research using micro-level administrative social science data. We argue that it is essential for researchers to share research code, because code sharing enables the elements of reproducible research. First, it enables results to be duplicated and therefore allows the accuracy and validity of analyses to be evaluated. Second, it facilitates further tests of the robustness of the original piece of research. Drawing on insights from computer science and other disciplines that have been engaged in e-Research we discuss and advocate the use of Git repositories to provide a useable and effective solution to research code sharing and rendering social science research using micro-level administrative data reproducible.
first_indexed 2024-12-14T20:27:11Z
format Article
id doaj.art-6c144ab1e6b74714a50172b3663375f4
institution Directory Open Access Journal
issn 2053-9517
language English
last_indexed 2024-12-14T20:27:11Z
publishDate 2016-12-01
publisher SAGE Publishing
record_format Article
series Big Data & Society
spelling doaj.art-6c144ab1e6b74714a50172b3663375f42022-12-21T22:48:36ZengSAGE PublishingBig Data & Society2053-95172016-12-01310.1177/205395171668414310.1177_2053951716684143Administrative social science data: The challenge of reproducible researchChristopher J Playford0Vernon Gayle1Roxanne Connelly2Alasdair JG Gray3Administrative Data Research Centre – Scotland, School of Social and Political Science, University of Edinburgh, UKAdministrative Data Research Centre – Scotland, School of Social and Political Science, University of Edinburgh, UKDepartment of Sociology, University of Warwick, UKSchool of Mathematical & Computer Sciences, Heriot-Watt University, UKPowerful new social science data resources are emerging. One particularly important source is administrative data, which were originally collected for organisational purposes but often contain information that is suitable for social science research. In this paper we outline the concept of reproducible research in relation to micro-level administrative social science data. Our central claim is that a planned and organised workflow is essential for high quality research using micro-level administrative social science data. We argue that it is essential for researchers to share research code, because code sharing enables the elements of reproducible research. First, it enables results to be duplicated and therefore allows the accuracy and validity of analyses to be evaluated. Second, it facilitates further tests of the robustness of the original piece of research. Drawing on insights from computer science and other disciplines that have been engaged in e-Research we discuss and advocate the use of Git repositories to provide a useable and effective solution to research code sharing and rendering social science research using micro-level administrative data reproducible.https://doi.org/10.1177/2053951716684143
spellingShingle Christopher J Playford
Vernon Gayle
Roxanne Connelly
Alasdair JG Gray
Administrative social science data: The challenge of reproducible research
Big Data & Society
title Administrative social science data: The challenge of reproducible research
title_full Administrative social science data: The challenge of reproducible research
title_fullStr Administrative social science data: The challenge of reproducible research
title_full_unstemmed Administrative social science data: The challenge of reproducible research
title_short Administrative social science data: The challenge of reproducible research
title_sort administrative social science data the challenge of reproducible research
url https://doi.org/10.1177/2053951716684143
work_keys_str_mv AT christopherjplayford administrativesocialsciencedatathechallengeofreproducibleresearch
AT vernongayle administrativesocialsciencedatathechallengeofreproducibleresearch
AT roxanneconnelly administrativesocialsciencedatathechallengeofreproducibleresearch
AT alasdairjggray administrativesocialsciencedatathechallengeofreproducibleresearch