Same data, different analysts: variation in effect sizes due to analytical decisions in ecology and evolutionary biology

Although variation in effect sizes and predicted values among studies of similar phenomena is inevitable, such variation far exceeds what might be produced by sampling error alone. One possible explanation for variation among results is differences among researchers in the decisions they make regard...

Full description

Bibliographic Details
Main Authors: Gould, E, Fraser, HS, Parker, TH, Nakagawa, S, Griffith, SC, Vesk, PA, Fidler, F, Hamilton, DG, Abbey-Lee, RN, Abbott, JK, Aguirre, LA, Alcaraz, C, Aloni, I, Altschul, D, Arekar, K, Atkins, JW, Atkinson, J, Baker, CM, Barrett, M, Bell, K, Bello, SK, Beltrán, I, Berauer, BJ, Bertram, MG, Wood, A
Format: Journal article
Language:English
Published: BioMed Central 2025
_version_ 1824459256444747776
author Gould, E
Fraser, HS
Parker, TH
Nakagawa, S
Griffith, SC
Vesk, PA
Fidler, F
Hamilton, DG
Abbey-Lee, RN
Abbott, JK
Aguirre, LA
Alcaraz, C
Aloni, I
Altschul, D
Arekar, K
Atkins, JW
Atkinson, J
Baker, CM
Barrett, M
Bell, K
Bello, SK
Beltrán, I
Berauer, BJ
Bertram, MG
Wood, A
author_facet Gould, E
Fraser, HS
Parker, TH
Nakagawa, S
Griffith, SC
Vesk, PA
Fidler, F
Hamilton, DG
Abbey-Lee, RN
Abbott, JK
Aguirre, LA
Alcaraz, C
Aloni, I
Altschul, D
Arekar, K
Atkins, JW
Atkinson, J
Baker, CM
Barrett, M
Bell, K
Bello, SK
Beltrán, I
Berauer, BJ
Bertram, MG
Wood, A
author_sort Gould, E
collection OXFORD
description Although variation in effect sizes and predicted values among studies of similar phenomena is inevitable, such variation far exceeds what might be produced by sampling error alone. One possible explanation for variation among results is differences among researchers in the decisions they make regarding statistical analyses. A growing array of studies has explored this analytical variability in different fields and has found substantial variability among results despite analysts having the same data and research question. Many of these studies have been in the social sciences, but one small “many analyst” study found similar variability in ecology. We expanded the scope of this prior work by implementing a large-scale empirical exploration of the variation in effect sizes and model predictions generated by the analytical decisions of different researchers in ecology and evolutionary biology. We used two unpublished datasets, one from evolutionary ecology (blue tit, Cyanistes caeruleus, to compare sibling number and nestling growth) and one from conservation ecology (Eucalyptus, to compare grass cover and tree seedling recruitment). The project leaders recruited 174 analyst teams, comprising 246 analysts, to investigate the answers to prespecified research questions. Analyses conducted by these teams yielded 141 usable effects (compatible with our meta-analyses and with all necessary information provided) for the blue tit dataset, and 85 usable effects for the Eucalyptus dataset. We found substantial heterogeneity among results for both datasets, although the patterns of variation differed between them. For the blue tit analyses, the average effect was convincingly negative, with less growth for nestlings living with more siblings, but there was near continuous variation in effect size from large negative effects to effects near zero, and even effects crossing the traditional threshold of statistical significance in the opposite direction. In contrast, the average relationship between grass cover and Eucalyptus seedling number was only slightly negative and not convincingly different from zero, and most effects ranged from weakly negative to weakly positive, with about a third of effects crossing the traditional threshold of significance in one direction or the other. However, there were also several striking outliers in the Eucalyptus dataset, with effects far from zero. For both datasets, we found substantial variation in the variable selection and random effects structures among analyses, as well as in the ratings of the analytical methods by peer reviewers, but we found no strong relationship between any of these and deviation from the meta-analytic mean. In other words, analyses with results that were far from the mean were no more or less likely to have dissimilar variable sets, use random effects in their models, or receive poor peer reviews than those analyses that found results that were close to the mean. The existence of substantial variability among analysis outcomes raises important questions about how ecologists and evolutionary biologists should interpret published results, and how they should conduct analyses in the future.
first_indexed 2025-02-19T04:38:54Z
format Journal article
id oxford-uuid:08c8ce57-40bb-4f20-b162-6839b82353b0
institution University of Oxford
language English
last_indexed 2025-02-19T04:38:54Z
publishDate 2025
publisher BioMed Central
record_format dspace
spelling oxford-uuid:08c8ce57-40bb-4f20-b162-6839b82353b02025-02-15T20:03:45ZSame data, different analysts: variation in effect sizes due to analytical decisions in ecology and evolutionary biologyJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:08c8ce57-40bb-4f20-b162-6839b82353b0EnglishJisc Publications RouterBioMed Central2025Gould, EFraser, HSParker, THNakagawa, SGriffith, SCVesk, PAFidler, FHamilton, DGAbbey-Lee, RNAbbott, JKAguirre, LAAlcaraz, CAloni, IAltschul, DArekar, KAtkins, JWAtkinson, JBaker, CMBarrett, MBell, KBello, SKBeltrán, IBerauer, BJBertram, MGWood, AAlthough variation in effect sizes and predicted values among studies of similar phenomena is inevitable, such variation far exceeds what might be produced by sampling error alone. One possible explanation for variation among results is differences among researchers in the decisions they make regarding statistical analyses. A growing array of studies has explored this analytical variability in different fields and has found substantial variability among results despite analysts having the same data and research question. Many of these studies have been in the social sciences, but one small “many analyst” study found similar variability in ecology. We expanded the scope of this prior work by implementing a large-scale empirical exploration of the variation in effect sizes and model predictions generated by the analytical decisions of different researchers in ecology and evolutionary biology. We used two unpublished datasets, one from evolutionary ecology (blue tit, Cyanistes caeruleus, to compare sibling number and nestling growth) and one from conservation ecology (Eucalyptus, to compare grass cover and tree seedling recruitment). The project leaders recruited 174 analyst teams, comprising 246 analysts, to investigate the answers to prespecified research questions. Analyses conducted by these teams yielded 141 usable effects (compatible with our meta-analyses and with all necessary information provided) for the blue tit dataset, and 85 usable effects for the Eucalyptus dataset. We found substantial heterogeneity among results for both datasets, although the patterns of variation differed between them. For the blue tit analyses, the average effect was convincingly negative, with less growth for nestlings living with more siblings, but there was near continuous variation in effect size from large negative effects to effects near zero, and even effects crossing the traditional threshold of statistical significance in the opposite direction. In contrast, the average relationship between grass cover and Eucalyptus seedling number was only slightly negative and not convincingly different from zero, and most effects ranged from weakly negative to weakly positive, with about a third of effects crossing the traditional threshold of significance in one direction or the other. However, there were also several striking outliers in the Eucalyptus dataset, with effects far from zero. For both datasets, we found substantial variation in the variable selection and random effects structures among analyses, as well as in the ratings of the analytical methods by peer reviewers, but we found no strong relationship between any of these and deviation from the meta-analytic mean. In other words, analyses with results that were far from the mean were no more or less likely to have dissimilar variable sets, use random effects in their models, or receive poor peer reviews than those analyses that found results that were close to the mean. The existence of substantial variability among analysis outcomes raises important questions about how ecologists and evolutionary biologists should interpret published results, and how they should conduct analyses in the future.
spellingShingle Gould, E
Fraser, HS
Parker, TH
Nakagawa, S
Griffith, SC
Vesk, PA
Fidler, F
Hamilton, DG
Abbey-Lee, RN
Abbott, JK
Aguirre, LA
Alcaraz, C
Aloni, I
Altschul, D
Arekar, K
Atkins, JW
Atkinson, J
Baker, CM
Barrett, M
Bell, K
Bello, SK
Beltrán, I
Berauer, BJ
Bertram, MG
Wood, A
Same data, different analysts: variation in effect sizes due to analytical decisions in ecology and evolutionary biology
title Same data, different analysts: variation in effect sizes due to analytical decisions in ecology and evolutionary biology
title_full Same data, different analysts: variation in effect sizes due to analytical decisions in ecology and evolutionary biology
title_fullStr Same data, different analysts: variation in effect sizes due to analytical decisions in ecology and evolutionary biology
title_full_unstemmed Same data, different analysts: variation in effect sizes due to analytical decisions in ecology and evolutionary biology
title_short Same data, different analysts: variation in effect sizes due to analytical decisions in ecology and evolutionary biology
title_sort same data different analysts variation in effect sizes due to analytical decisions in ecology and evolutionary biology
work_keys_str_mv AT goulde samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT fraserhs samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT parkerth samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT nakagawas samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT griffithsc samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT veskpa samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT fidlerf samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT hamiltondg samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT abbeyleern samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT abbottjk samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT aguirrela samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT alcarazc samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT alonii samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT altschuld samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT arekark samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT atkinsjw samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT atkinsonj samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT bakercm samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT barrettm samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT bellk samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT bellosk samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT beltrani samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT berauerbj samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT bertrammg samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology
AT wooda samedatadifferentanalystsvariationineffectsizesduetoanalyticaldecisionsinecologyandevolutionarybiology