Data availability, reusability, and analytic reproducibility: evaluating the impact of a mandatory open data policy at the journal Cognition

Access to data is a critical feature of an efficient, progressive and ultimately self-correcting scientific ecosystem. But the extent to which in-principle benefits of data sharing are realized in practice is unclear. Crucially, it is largely unknown whether published findings can be reproduced by r...

Full description

Bibliographic Details
Main Authors: Tom E. Hardwicke, Maya B. Mathur, Kyle MacDonald, Gustav Nilsonne, George C. Banks, Mallory C. Kidwell, Alicia Hofelich Mohr, Elizabeth Clayton, Erica J. Yoon, Michael Henry Tessler, Richie L. Lenne, Sara Altman, Bria Long, Michael C. Frank
Format: Article
Language:English
Published: The Royal Society 2018-01-01
Series:Royal Society Open Science
Subjects:
Online Access:https://royalsocietypublishing.org/doi/pdf/10.1098/rsos.180448
_version_ 1818991810425389056
author Tom E. Hardwicke
Maya B. Mathur
Kyle MacDonald
Gustav Nilsonne
George C. Banks
Mallory C. Kidwell
Alicia Hofelich Mohr
Elizabeth Clayton
Erica J. Yoon
Michael Henry Tessler
Richie L. Lenne
Sara Altman
Bria Long
Michael C. Frank
author_facet Tom E. Hardwicke
Maya B. Mathur
Kyle MacDonald
Gustav Nilsonne
George C. Banks
Mallory C. Kidwell
Alicia Hofelich Mohr
Elizabeth Clayton
Erica J. Yoon
Michael Henry Tessler
Richie L. Lenne
Sara Altman
Bria Long
Michael C. Frank
author_sort Tom E. Hardwicke
collection DOAJ
description Access to data is a critical feature of an efficient, progressive and ultimately self-correcting scientific ecosystem. But the extent to which in-principle benefits of data sharing are realized in practice is unclear. Crucially, it is largely unknown whether published findings can be reproduced by repeating reported analyses upon shared data (‘analytic reproducibility’). To investigate this, we conducted an observational evaluation of a mandatory open data policy introduced at the journal Cognition. Interrupted time-series analyses indicated a substantial post-policy increase in data available statements (104/417, 25% pre-policy to 136/174, 78% post-policy), although not all data appeared reusable (23/104, 22% pre-policy to 85/136, 62%, post-policy). For 35 of the articles determined to have reusable data, we attempted to reproduce 1324 target values. Ultimately, 64 values could not be reproduced within a 10% margin of error. For 22 articles all target values were reproduced, but 11 of these required author assistance. For 13 articles at least one value could not be reproduced despite author assistance. Importantly, there were no clear indications that original conclusions were seriously impacted. Mandatory open data policies can increase the frequency and quality of data sharing. However, suboptimal data curation, unclear analysis specification and reporting errors can impede analytic reproducibility, undermining the utility of data sharing and the credibility of scientific findings.
first_indexed 2024-12-20T20:16:11Z
format Article
id doaj.art-9159a399ecf14bc3a5b4b2656e011149
institution Directory Open Access Journal
issn 2054-5703
language English
last_indexed 2024-12-20T20:16:11Z
publishDate 2018-01-01
publisher The Royal Society
record_format Article
series Royal Society Open Science
spelling doaj.art-9159a399ecf14bc3a5b4b2656e0111492022-12-21T19:27:43ZengThe Royal SocietyRoyal Society Open Science2054-57032018-01-015810.1098/rsos.180448180448Data availability, reusability, and analytic reproducibility: evaluating the impact of a mandatory open data policy at the journal CognitionTom E. HardwickeMaya B. MathurKyle MacDonaldGustav NilsonneGeorge C. BanksMallory C. KidwellAlicia Hofelich MohrElizabeth ClaytonErica J. YoonMichael Henry TesslerRichie L. LenneSara AltmanBria LongMichael C. FrankAccess to data is a critical feature of an efficient, progressive and ultimately self-correcting scientific ecosystem. But the extent to which in-principle benefits of data sharing are realized in practice is unclear. Crucially, it is largely unknown whether published findings can be reproduced by repeating reported analyses upon shared data (‘analytic reproducibility’). To investigate this, we conducted an observational evaluation of a mandatory open data policy introduced at the journal Cognition. Interrupted time-series analyses indicated a substantial post-policy increase in data available statements (104/417, 25% pre-policy to 136/174, 78% post-policy), although not all data appeared reusable (23/104, 22% pre-policy to 85/136, 62%, post-policy). For 35 of the articles determined to have reusable data, we attempted to reproduce 1324 target values. Ultimately, 64 values could not be reproduced within a 10% margin of error. For 22 articles all target values were reproduced, but 11 of these required author assistance. For 13 articles at least one value could not be reproduced despite author assistance. Importantly, there were no clear indications that original conclusions were seriously impacted. Mandatory open data policies can increase the frequency and quality of data sharing. However, suboptimal data curation, unclear analysis specification and reporting errors can impede analytic reproducibility, undermining the utility of data sharing and the credibility of scientific findings.https://royalsocietypublishing.org/doi/pdf/10.1098/rsos.180448open datareproducibilityopen sciencemeta-scienceinterrupted time seriesjournal policy
spellingShingle Tom E. Hardwicke
Maya B. Mathur
Kyle MacDonald
Gustav Nilsonne
George C. Banks
Mallory C. Kidwell
Alicia Hofelich Mohr
Elizabeth Clayton
Erica J. Yoon
Michael Henry Tessler
Richie L. Lenne
Sara Altman
Bria Long
Michael C. Frank
Data availability, reusability, and analytic reproducibility: evaluating the impact of a mandatory open data policy at the journal Cognition
Royal Society Open Science
open data
reproducibility
open science
meta-science
interrupted time series
journal policy
title Data availability, reusability, and analytic reproducibility: evaluating the impact of a mandatory open data policy at the journal Cognition
title_full Data availability, reusability, and analytic reproducibility: evaluating the impact of a mandatory open data policy at the journal Cognition
title_fullStr Data availability, reusability, and analytic reproducibility: evaluating the impact of a mandatory open data policy at the journal Cognition
title_full_unstemmed Data availability, reusability, and analytic reproducibility: evaluating the impact of a mandatory open data policy at the journal Cognition
title_short Data availability, reusability, and analytic reproducibility: evaluating the impact of a mandatory open data policy at the journal Cognition
title_sort data availability reusability and analytic reproducibility evaluating the impact of a mandatory open data policy at the journal cognition
topic open data
reproducibility
open science
meta-science
interrupted time series
journal policy
url https://royalsocietypublishing.org/doi/pdf/10.1098/rsos.180448
work_keys_str_mv AT tomehardwicke dataavailabilityreusabilityandanalyticreproducibilityevaluatingtheimpactofamandatoryopendatapolicyatthejournalcognition
AT mayabmathur dataavailabilityreusabilityandanalyticreproducibilityevaluatingtheimpactofamandatoryopendatapolicyatthejournalcognition
AT kylemacdonald dataavailabilityreusabilityandanalyticreproducibilityevaluatingtheimpactofamandatoryopendatapolicyatthejournalcognition
AT gustavnilsonne dataavailabilityreusabilityandanalyticreproducibilityevaluatingtheimpactofamandatoryopendatapolicyatthejournalcognition
AT georgecbanks dataavailabilityreusabilityandanalyticreproducibilityevaluatingtheimpactofamandatoryopendatapolicyatthejournalcognition
AT malloryckidwell dataavailabilityreusabilityandanalyticreproducibilityevaluatingtheimpactofamandatoryopendatapolicyatthejournalcognition
AT aliciahofelichmohr dataavailabilityreusabilityandanalyticreproducibilityevaluatingtheimpactofamandatoryopendatapolicyatthejournalcognition
AT elizabethclayton dataavailabilityreusabilityandanalyticreproducibilityevaluatingtheimpactofamandatoryopendatapolicyatthejournalcognition
AT ericajyoon dataavailabilityreusabilityandanalyticreproducibilityevaluatingtheimpactofamandatoryopendatapolicyatthejournalcognition
AT michaelhenrytessler dataavailabilityreusabilityandanalyticreproducibilityevaluatingtheimpactofamandatoryopendatapolicyatthejournalcognition
AT richiellenne dataavailabilityreusabilityandanalyticreproducibilityevaluatingtheimpactofamandatoryopendatapolicyatthejournalcognition
AT saraaltman dataavailabilityreusabilityandanalyticreproducibilityevaluatingtheimpactofamandatoryopendatapolicyatthejournalcognition
AT brialong dataavailabilityreusabilityandanalyticreproducibilityevaluatingtheimpactofamandatoryopendatapolicyatthejournalcognition
AT michaelcfrank dataavailabilityreusabilityandanalyticreproducibilityevaluatingtheimpactofamandatoryopendatapolicyatthejournalcognition