When do Risky Substitutions Improve Match Outcomes? Interpreting Causal Machine Learning Models for Insights

We investigate the impact of risk-taking on football match outcomes within the context of substitutions. The study uses an extensive dataset of 75 thousand substitutions spanning eight seasons across six leagues, encompassing team, match and manager characteristics, and interim results. Given the in...

Full description

Bibliographic Details
Main Authors: Ozden Gur Ali, Emrah Yilmaz
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10320317/
_version_ 1797319534838808576
author Ozden Gur Ali
Emrah Yilmaz
author_facet Ozden Gur Ali
Emrah Yilmaz
author_sort Ozden Gur Ali
collection DOAJ
description We investigate the impact of risk-taking on football match outcomes within the context of substitutions. The study uses an extensive dataset of 75 thousand substitutions spanning eight seasons across six leagues, encompassing team, match and manager characteristics, and interim results. Given the intricate web of potential drivers and the absence of a well-defined theoretical framework, conventional econometric approaches fall short. Instead, Causal Forest, a black-box causal machine learning (ML) technique, was adopted to estimate the heterogeneous treatment effects, coupled with robustness checks. Its estimates are explained through Generalized Additive Models (GAM), visualizing how different factors affect the risk-taking probability and moderate the treatment effect of risk-taking on match outcomes. Fidelity of the explanation is improved by segmenting the dataset to capture strong interaction effects. The study reveals that risk-taking propensity peaks when a team trails by 2–3 goals and tapers when leading by the same margin. Considering impact of risk-taking on match outcomes, interestingly, younger managers exhibit superior performance compared to middle-aged ones, while older counterparts excel in later substitutions. The manager’s tenure with the team increases the impact, tapering in the long run. Earlier substitutions, stronger teams have greater impact when risk-taking. Teams leading by one goal stand to benefit most from risk-taking in substitutions. This research underscores the synergistic potential of black-box causal machine learning and interpretable models, given large representative datasets with all confounders. Insights into football risk-taking dynamics have implications for managerial strategies. Beyond football, it is noteworthy that manager age and tenure with the team are top first and third factors determining the effectiveness of a risky decision.
first_indexed 2024-03-08T04:09:21Z
format Article
id doaj.art-2405bc4b6f2140b6aa151fc516ea4c69
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-08T04:09:21Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-2405bc4b6f2140b6aa151fc516ea4c692024-02-09T00:01:04ZengIEEEIEEE Access2169-35362023-01-011113135113136110.1109/ACCESS.2023.333387810320317When do Risky Substitutions Improve Match Outcomes? Interpreting Causal Machine Learning Models for InsightsOzden Gur Ali0https://orcid.org/0000-0002-9409-4532Emrah Yilmaz1College of Administrative Sciences and Economics, Koç University, Sariyer, İstanbul, TurkeyCollege of Administrative Sciences and Economics, Koç University, Sariyer, İstanbul, TurkeyWe investigate the impact of risk-taking on football match outcomes within the context of substitutions. The study uses an extensive dataset of 75 thousand substitutions spanning eight seasons across six leagues, encompassing team, match and manager characteristics, and interim results. Given the intricate web of potential drivers and the absence of a well-defined theoretical framework, conventional econometric approaches fall short. Instead, Causal Forest, a black-box causal machine learning (ML) technique, was adopted to estimate the heterogeneous treatment effects, coupled with robustness checks. Its estimates are explained through Generalized Additive Models (GAM), visualizing how different factors affect the risk-taking probability and moderate the treatment effect of risk-taking on match outcomes. Fidelity of the explanation is improved by segmenting the dataset to capture strong interaction effects. The study reveals that risk-taking propensity peaks when a team trails by 2–3 goals and tapers when leading by the same margin. Considering impact of risk-taking on match outcomes, interestingly, younger managers exhibit superior performance compared to middle-aged ones, while older counterparts excel in later substitutions. The manager’s tenure with the team increases the impact, tapering in the long run. Earlier substitutions, stronger teams have greater impact when risk-taking. Teams leading by one goal stand to benefit most from risk-taking in substitutions. This research underscores the synergistic potential of black-box causal machine learning and interpretable models, given large representative datasets with all confounders. Insights into football risk-taking dynamics have implications for managerial strategies. Beyond football, it is noteworthy that manager age and tenure with the team are top first and third factors determining the effectiveness of a risky decision.https://ieeexplore.ieee.org/document/10320317/Football analyticsrisk-takingdecision makingheterogeneous treatment effectcausal forestgeneralized additive models (GAM)
spellingShingle Ozden Gur Ali
Emrah Yilmaz
When do Risky Substitutions Improve Match Outcomes? Interpreting Causal Machine Learning Models for Insights
IEEE Access
Football analytics
risk-taking
decision making
heterogeneous treatment effect
causal forest
generalized additive models (GAM)
title When do Risky Substitutions Improve Match Outcomes? Interpreting Causal Machine Learning Models for Insights
title_full When do Risky Substitutions Improve Match Outcomes? Interpreting Causal Machine Learning Models for Insights
title_fullStr When do Risky Substitutions Improve Match Outcomes? Interpreting Causal Machine Learning Models for Insights
title_full_unstemmed When do Risky Substitutions Improve Match Outcomes? Interpreting Causal Machine Learning Models for Insights
title_short When do Risky Substitutions Improve Match Outcomes? Interpreting Causal Machine Learning Models for Insights
title_sort when do risky substitutions improve match outcomes interpreting causal machine learning models for insights
topic Football analytics
risk-taking
decision making
heterogeneous treatment effect
causal forest
generalized additive models (GAM)
url https://ieeexplore.ieee.org/document/10320317/
work_keys_str_mv AT ozdengurali whendoriskysubstitutionsimprovematchoutcomesinterpretingcausalmachinelearningmodelsforinsights
AT emrahyilmaz whendoriskysubstitutionsimprovematchoutcomesinterpretingcausalmachinelearningmodelsforinsights