Machine learning augmentation reduces prediction error in collective forecasting: development and validation across prediction markets with application to COVID eventsResearch in context

Summary: Background: The recent COVID-19 pandemic highlighted the challenges for traditional forecasting. Prediction markets are a promising way to generate collective forecasts and could potentially be enhanced if high-quality crowdsourced inputs were identified and preferentially weighted for lik...

Full description

Bibliographic Details
Main Authors: Alexander Gruen, Karl R. Mattingly, Ellen Morwitch, Frederik Bossaerts, Manning Clifford, Chad Nash, John P.A. Ioannidis, Anne-Louise Ponsonby
Format: Article
Language:English
Published: Elsevier 2023-10-01
Series:EBioMedicine
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352396423003493
_version_ 1827819965911138304
author Alexander Gruen
Karl R. Mattingly
Ellen Morwitch
Frederik Bossaerts
Manning Clifford
Chad Nash
John P.A. Ioannidis
Anne-Louise Ponsonby
author_facet Alexander Gruen
Karl R. Mattingly
Ellen Morwitch
Frederik Bossaerts
Manning Clifford
Chad Nash
John P.A. Ioannidis
Anne-Louise Ponsonby
author_sort Alexander Gruen
collection DOAJ
description Summary: Background: The recent COVID-19 pandemic highlighted the challenges for traditional forecasting. Prediction markets are a promising way to generate collective forecasts and could potentially be enhanced if high-quality crowdsourced inputs were identified and preferentially weighted for likely accuracy in real-time with machine learning. Methods: We aim to leverage human prediction markets with real-time machine weighting of likely higher accuracy trades to improve performance. The crowd sourced Almanis prediction market longitudinal platform (n = 1822) and Next Generation Social Science (NGS2) platform (n = 103) were utilised. Findings: A 43-feature model predicted accurate forecasters, those with top quintile relative Brier accuracy, with subsequent replication in two out-of-sample datasets (pboth <1 × 10−9). Trades graded by this model as having higher accuracy scores than others produced a greater AUC temporal gain in the overall market after vs before trade. Accuracy score-weighted forecasts had higher accuracy than market forecasts alone, particularly when the two systems disagreed by 5% or more for binary event prediction: the hybrid system demonstrating substantial % AUC gains of 13.2%, p = 1.35 × 10−14 and 13.8%, p = 0.003 in two out-of-sample datasets. When discordant, the hybrid model was correct for COVID-19 event occurrence 72.7% of the time vs 27.3% for market models, p = 0.007. This net classification benefit was replicated in the separate Almanis B dataset, p = 2.4 × 10−7. Interpretation: Real-time machine classification followed by weighting human trades according to likely accuracy improves collective forecasting performance. This could provide improved anticipation of and thus response to emerging risks. Funding: This work was supported by an AusIndustry R and D tax incentive program from the Department of Industry, Science, Energy and Resources, Australia, to SlowVoice Pty Ltd. (IR 2101990) and Fellowship (GNT 1110200) and Investigator grant (GNT 1197234) to A-L Ponsonby by the National Health and Medical Research Council of Australia.
first_indexed 2024-03-12T01:21:32Z
format Article
id doaj.art-765e063c58d14d58926bc5310f12ffd3
institution Directory Open Access Journal
issn 2352-3964
language English
last_indexed 2024-03-12T01:21:32Z
publishDate 2023-10-01
publisher Elsevier
record_format Article
series EBioMedicine
spelling doaj.art-765e063c58d14d58926bc5310f12ffd32023-09-13T04:25:09ZengElsevierEBioMedicine2352-39642023-10-0196104783Machine learning augmentation reduces prediction error in collective forecasting: development and validation across prediction markets with application to COVID eventsResearch in contextAlexander Gruen0Karl R. Mattingly1Ellen Morwitch2Frederik Bossaerts3Manning Clifford4Chad Nash5John P.A. Ioannidis6Anne-Louise Ponsonby7The Florey Institute of Neuroscience and Mental Health, Melbourne, AustraliaDysrupt Labs by SlowVoice, Melbourne, AustraliaThe Florey Institute of Neuroscience and Mental Health, Melbourne, AustraliaDysrupt Labs by SlowVoice, Melbourne, AustraliaBoston Consulting Group, Melbourne, AustraliaDysrupt Labs by SlowVoice, Melbourne, AustraliaStanford Prevention Research Center, Department of Medicine, and Departments of Epidemiology and Population Health, of Biomedical Data Science, and of Statistics, Stanford University, Meta-Research Innovation Center at Stanford, Stanford, CA, USAThe Florey Institute of Neuroscience and Mental Health, Melbourne, Australia; Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, Australia; Centre of Epidemiology and Biostatistics, School of Population and Global Health, University of Melbourne, Australia; Corresponding author. The Florey Institute of Neuroscience and Mental Health; Melbourne, Australia.Summary: Background: The recent COVID-19 pandemic highlighted the challenges for traditional forecasting. Prediction markets are a promising way to generate collective forecasts and could potentially be enhanced if high-quality crowdsourced inputs were identified and preferentially weighted for likely accuracy in real-time with machine learning. Methods: We aim to leverage human prediction markets with real-time machine weighting of likely higher accuracy trades to improve performance. The crowd sourced Almanis prediction market longitudinal platform (n = 1822) and Next Generation Social Science (NGS2) platform (n = 103) were utilised. Findings: A 43-feature model predicted accurate forecasters, those with top quintile relative Brier accuracy, with subsequent replication in two out-of-sample datasets (pboth <1 × 10−9). Trades graded by this model as having higher accuracy scores than others produced a greater AUC temporal gain in the overall market after vs before trade. Accuracy score-weighted forecasts had higher accuracy than market forecasts alone, particularly when the two systems disagreed by 5% or more for binary event prediction: the hybrid system demonstrating substantial % AUC gains of 13.2%, p = 1.35 × 10−14 and 13.8%, p = 0.003 in two out-of-sample datasets. When discordant, the hybrid model was correct for COVID-19 event occurrence 72.7% of the time vs 27.3% for market models, p = 0.007. This net classification benefit was replicated in the separate Almanis B dataset, p = 2.4 × 10−7. Interpretation: Real-time machine classification followed by weighting human trades according to likely accuracy improves collective forecasting performance. This could provide improved anticipation of and thus response to emerging risks. Funding: This work was supported by an AusIndustry R and D tax incentive program from the Department of Industry, Science, Energy and Resources, Australia, to SlowVoice Pty Ltd. (IR 2101990) and Fellowship (GNT 1110200) and Investigator grant (GNT 1197234) to A-L Ponsonby by the National Health and Medical Research Council of Australia.http://www.sciencedirect.com/science/article/pii/S2352396423003493Collective intelligenceForecast accuracyPrediction marketHuman-machine forecastCOVID
spellingShingle Alexander Gruen
Karl R. Mattingly
Ellen Morwitch
Frederik Bossaerts
Manning Clifford
Chad Nash
John P.A. Ioannidis
Anne-Louise Ponsonby
Machine learning augmentation reduces prediction error in collective forecasting: development and validation across prediction markets with application to COVID eventsResearch in context
EBioMedicine
Collective intelligence
Forecast accuracy
Prediction market
Human-machine forecast
COVID
title Machine learning augmentation reduces prediction error in collective forecasting: development and validation across prediction markets with application to COVID eventsResearch in context
title_full Machine learning augmentation reduces prediction error in collective forecasting: development and validation across prediction markets with application to COVID eventsResearch in context
title_fullStr Machine learning augmentation reduces prediction error in collective forecasting: development and validation across prediction markets with application to COVID eventsResearch in context
title_full_unstemmed Machine learning augmentation reduces prediction error in collective forecasting: development and validation across prediction markets with application to COVID eventsResearch in context
title_short Machine learning augmentation reduces prediction error in collective forecasting: development and validation across prediction markets with application to COVID eventsResearch in context
title_sort machine learning augmentation reduces prediction error in collective forecasting development and validation across prediction markets with application to covid eventsresearch in context
topic Collective intelligence
Forecast accuracy
Prediction market
Human-machine forecast
COVID
url http://www.sciencedirect.com/science/article/pii/S2352396423003493
work_keys_str_mv AT alexandergruen machinelearningaugmentationreducespredictionerrorincollectiveforecastingdevelopmentandvalidationacrosspredictionmarketswithapplicationtocovideventsresearchincontext
AT karlrmattingly machinelearningaugmentationreducespredictionerrorincollectiveforecastingdevelopmentandvalidationacrosspredictionmarketswithapplicationtocovideventsresearchincontext
AT ellenmorwitch machinelearningaugmentationreducespredictionerrorincollectiveforecastingdevelopmentandvalidationacrosspredictionmarketswithapplicationtocovideventsresearchincontext
AT frederikbossaerts machinelearningaugmentationreducespredictionerrorincollectiveforecastingdevelopmentandvalidationacrosspredictionmarketswithapplicationtocovideventsresearchincontext
AT manningclifford machinelearningaugmentationreducespredictionerrorincollectiveforecastingdevelopmentandvalidationacrosspredictionmarketswithapplicationtocovideventsresearchincontext
AT chadnash machinelearningaugmentationreducespredictionerrorincollectiveforecastingdevelopmentandvalidationacrosspredictionmarketswithapplicationtocovideventsresearchincontext
AT johnpaioannidis machinelearningaugmentationreducespredictionerrorincollectiveforecastingdevelopmentandvalidationacrosspredictionmarketswithapplicationtocovideventsresearchincontext
AT annelouiseponsonby machinelearningaugmentationreducespredictionerrorincollectiveforecastingdevelopmentandvalidationacrosspredictionmarketswithapplicationtocovideventsresearchincontext