A Goal Scoring Probability Model for Shots Based on Synchronized Positional and Event Data in Football (Soccer)

Due to the low scoring nature of football (soccer), shots are often used as a proxy to evaluate team and player performances. However, not all shots are created equally and their quality differs significantly depending on the situation. The aim of this study is to objectively quantify the quality of...

Full description

Bibliographic Details
Main Authors: Gabriel Anzer, Pascal Bauer
Format: Article
Language:English
Published: Frontiers Media S.A. 2021-03-01
Series:Frontiers in Sports and Active Living
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fspor.2021.624475/full
_version_ 1818603389894787072
author Gabriel Anzer
Gabriel Anzer
Pascal Bauer
Pascal Bauer
author_facet Gabriel Anzer
Gabriel Anzer
Pascal Bauer
Pascal Bauer
author_sort Gabriel Anzer
collection DOAJ
description Due to the low scoring nature of football (soccer), shots are often used as a proxy to evaluate team and player performances. However, not all shots are created equally and their quality differs significantly depending on the situation. The aim of this study is to objectively quantify the quality of any given shot by introducing a so-called expected goals (xG) model. This model is validated statistically and with professional match analysts. The best performing model uses an extreme gradient boosting algorithm and is based on hand-crafted features from synchronized positional and event data of 105, 627 shots in the German Bundesliga. With a ranked probability score (RPS) of 0.197, it is more accurate than any previously published expected goals model. This approach allows us to assess team and player performances far more accurately than is possible with traditional metrics by focusing on process rather than results.
first_indexed 2024-12-16T13:22:24Z
format Article
id doaj.art-482c55c6b1304283bc854b1c81fe0295
institution Directory Open Access Journal
issn 2624-9367
language English
last_indexed 2024-12-16T13:22:24Z
publishDate 2021-03-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Sports and Active Living
spelling doaj.art-482c55c6b1304283bc854b1c81fe02952022-12-21T22:30:19ZengFrontiers Media S.A.Frontiers in Sports and Active Living2624-93672021-03-01310.3389/fspor.2021.624475624475A Goal Scoring Probability Model for Shots Based on Synchronized Positional and Event Data in Football (Soccer)Gabriel Anzer0Gabriel Anzer1Pascal Bauer2Pascal Bauer3Sportec Solutions AG, Subsidiary of the Deutsche Fußball Liga (DFL), Munich, GermanyInstitute of Sports Science, University of Tübingen, Tübingen, GermanyInstitute of Sports Science, University of Tübingen, Tübingen, GermanyDFB-Akademie, Deutscher Fußball-Bund e.V., Frankfurt am Main, GermanyDue to the low scoring nature of football (soccer), shots are often used as a proxy to evaluate team and player performances. However, not all shots are created equally and their quality differs significantly depending on the situation. The aim of this study is to objectively quantify the quality of any given shot by introducing a so-called expected goals (xG) model. This model is validated statistically and with professional match analysts. The best performing model uses an extreme gradient boosting algorithm and is based on hand-crafted features from synchronized positional and event data of 105, 627 shots in the German Bundesliga. With a ranked probability score (RPS) of 0.197, it is more accurate than any previously published expected goals model. This approach allows us to assess team and player performances far more accurately than is possible with traditional metrics by focusing on process rather than results.https://www.frontiersin.org/articles/10.3389/fspor.2021.624475/fullexpected goalsXGpositional dataevent dataapplied machine learningfootball
spellingShingle Gabriel Anzer
Gabriel Anzer
Pascal Bauer
Pascal Bauer
A Goal Scoring Probability Model for Shots Based on Synchronized Positional and Event Data in Football (Soccer)
Frontiers in Sports and Active Living
expected goals
XG
positional data
event data
applied machine learning
football
title A Goal Scoring Probability Model for Shots Based on Synchronized Positional and Event Data in Football (Soccer)
title_full A Goal Scoring Probability Model for Shots Based on Synchronized Positional and Event Data in Football (Soccer)
title_fullStr A Goal Scoring Probability Model for Shots Based on Synchronized Positional and Event Data in Football (Soccer)
title_full_unstemmed A Goal Scoring Probability Model for Shots Based on Synchronized Positional and Event Data in Football (Soccer)
title_short A Goal Scoring Probability Model for Shots Based on Synchronized Positional and Event Data in Football (Soccer)
title_sort goal scoring probability model for shots based on synchronized positional and event data in football soccer
topic expected goals
XG
positional data
event data
applied machine learning
football
url https://www.frontiersin.org/articles/10.3389/fspor.2021.624475/full
work_keys_str_mv AT gabrielanzer agoalscoringprobabilitymodelforshotsbasedonsynchronizedpositionalandeventdatainfootballsoccer
AT gabrielanzer agoalscoringprobabilitymodelforshotsbasedonsynchronizedpositionalandeventdatainfootballsoccer
AT pascalbauer agoalscoringprobabilitymodelforshotsbasedonsynchronizedpositionalandeventdatainfootballsoccer
AT pascalbauer agoalscoringprobabilitymodelforshotsbasedonsynchronizedpositionalandeventdatainfootballsoccer
AT gabrielanzer goalscoringprobabilitymodelforshotsbasedonsynchronizedpositionalandeventdatainfootballsoccer
AT gabrielanzer goalscoringprobabilitymodelforshotsbasedonsynchronizedpositionalandeventdatainfootballsoccer
AT pascalbauer goalscoringprobabilitymodelforshotsbasedonsynchronizedpositionalandeventdatainfootballsoccer
AT pascalbauer goalscoringprobabilitymodelforshotsbasedonsynchronizedpositionalandeventdatainfootballsoccer