Abstract

Passes are by far football’s (soccer) most frequent event, yet surprisingly little meaningful research has been devoted to quantify them. With the increase in availability of so-called positional data, describing the positioning of players and ball at every moment of the game, our work aims to determine the difficulty of every pass by calculating its success probability based on its surrounding circumstances. As most experts will agree, not all passes are of equal difficulty, however, most traditional metrics count them as such. With our work we can quantify how well players can execute passes, assess their risk profile, and even compute completion probabilities for hypothetical passes by combining physical and machine learning models. Our model uses the first 0.4 seconds of a ball trajectory and the movement vectors of all players to predict the intended target of a pass with an accuracy of 93.0% for successful and 72.0% for unsuccessful passes much higher than any previously published work. Our extreme gradient boosting model can then quantify the likelihood of a successful pass completion towards the identified target with an area under the curve (AUC) of 93.4%. Finally, we discuss several potential applications, like player scouting or evaluating pass decisions.

Highlights

  • Passes are a crucial part of modern football matches

  • Pass completion rates are regularly used as performance indicators on team and player levels in literature (Bradley et al 2013; Król et al 2017) and in the daily business of professional football teams

  • Pettersen et al (2014) present a publicly available set of positional data, and open source event data can be found in Pappalardo et al (2019)

Read more

Summary

Introduction

Passes are a crucial part of modern football (soccer) matches. traditionally player’s passing performance is quantified using a binary pass completion metric. In order to retrieve the missing information regarding the intended receiver of a pass (at the moment the pass was played), they modelled both ball and player trajectory based on physical simulations first This allows them to calculate an xPass value, as the predicted probability of a pass being completed, at the moment when the pass is played. Overall the literature is lacking a thoroughly described method of synchronizing pass events with tracking data, a highly accurate intended receiver estimation and a properly (manually) evaluated xPass model Our work fills this gap, while keeping the individual modules completely separated and introduces novel concepts, like blocking probabilities. By combining and slightly improving previor work, we exceed the accuracy of all previously presented results for the prediction of the pass receiver as well as the classification of played passes being successful or not

Data and definitions
Estimating the target
Modelling the ball trajectory
Movement model
Target estimation
Physics-based passing features
Pass probability estimation
22 Height
Gradient Boosting
Manual validation
Discussion
A The synchronization of pass events
B Details on XGBoost expected pass and blocking model
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call