Vienna University of Economics and Business
May 24, 2024
Jan-Ole Koslik
PhD Student
Universität Bielefeld
Rouven Michels
PhD Student
Universität Bielefeld
Marius Ötting
Postdoc
Universität Bielefeld
How can we value a tackle?
How can we value a tackle?
Measure the yards saved by a tackle:
Predict the end-of-play yard line (EOPY) when removing the tackler from the data.
Compare to real EOPY (or predicted on actual data with tackler).
Problem: Value of yards saved depends on the game state.
Example 1: 2 yards saved close to the defenders end zone vs. 2 yards saved close to the opponents end zone.
Example 2: 2 yards saved on 4th and 10 vs. 2 yards saved on 4th and 2.
How can we value a tackle?
Measure the prevented expected points (PEP) by a tackle:
Mapping from EOPY to expected points.
Point prediction of mean EOPY lacks uncertainty quantification.
How can we value a tackle?
Measure the prevented expected points (PEP) by a tackle:
Mapping from EOPY to expected points.
Point prediction of mean EOPY lacks uncertainty quantification.
Full conditional density estimate necessary to calculate expected points.
Estimate the full conditional density of the EOPY.
Estimate an expected points model as mapping of the EOPY.
Estimate a tackle value by comparing the real outcome with the hypothetical outcome:
Repeatedly grow different regression trees.
Trees are build to decrease variance (decrease correlation between single trees) via randomization:
Usually: predictions obtained by averaging individual tree predictions.
Train random forest on preprocessed data:
Train two models for evaluation of tackles:
Model evaluation:
Expected points: Given the game situation, how many points do we expect the team to score?
\[ {\rm EP} = \mathbb{E}[Y|X] = \sum_{y} y \cdot \mathbb{P}(Y = y|X),\] \(Y \dots\) scoring outcomes, \(X \dots\) covariates describing the game state.
Estimate the probabilities \(\mathbb{P}(Y = y|X)\) for each scoring outcome.
Covariates \(X\):
XGBoost model for estimation of probabilities with careful tuning and evaluation.
At the time of tackle: estimate the mean expected points from the EOPY predictions.
\[\mathbb{E}(g(Y) \mid x) = \int g(y) \: \hat{f} (y \mid x) \: dy,\]
\(g(y)\): Mapping from the yard line \(y\) to expected points (EP model).
\(\hat{f} (y \mid x)\): conditional density estimate from the random forest.
Calculation could be done two ways:
\[\frac{1}{1000} \sum_{i=1}^{1000} g(\hat{y}_i)\]
\(\mathbb{E}(g(Y) \mid x)\) allows to analyze predicted yards gained on a EP scale.
Tackle evaluation:
Still interested in evaluating a hypothetical scenario \(\Rightarrow\) evaluate mean expected points for the tackle, when removing the tackler from play \(\mathbb{E}(g(Y) \mid x_{removed})\):
We know the true outcome \(y_0\) (the true EOPY) \(\Rightarrow\) compute EP \(g(y_0)\).
Prevented expected points:
\[\text{PEP} = \text{E}(g(Y) \mid x_{removed}) - g(y_0).\]
How can we evaluate a player’s tackling ability?
Sum up the PEP values of each tackle a player makes.
How can we evaluate a player’s tackling ability?
Sum up the PEP values of each tackle a player makes.
Average the PEP values of players (if tackle reasonably often).
How can we evaluate a player’s tackling ability?
Model the impact of players on PEP values for specific tackle.
View full results table and more: click here
PEP: quantifying tackle values on a relevant scale to the game.
Using conditional density estimates to propagate uncertainty from yards gained to EP scale.
Extensions:
Ball Carrier? Evaluate ball carriers ability to maintain EPs despite being tackled.
Missed tackles and assisted tackles? Measuring the impact of players close by on PEP.
Thank you for your attention!