Ghost-GK v1 β Conditional Density Estimation for Goalkeeper Positioning
Predicts where a league-average goalkeeper would position themselves given the current game state. Uses RFCDE (Random Forest Conditional Density Estimation) over HistGradientBoostingRegressor leaf assignments with weighted 2D kernel density estimation.
Part of the silly-kicks soccer analytics library (GKDV research program, TF-18, Layer 2).
Model Description
Standard goalkeeper evaluation metrics (xGOT, save percentage, goals prevented) measure what happens after a shot is taken. Ghost-GK addresses the upstream question: given the current game state, where should the goalkeeper be standing?
The model learns a league-average positional density from thousands of match frames across multiple tracking data providers. For any given frame, it outputs a full 2D probability distribution over the goal-relative region, not just a point estimate. This enables downstream metrics like the GK Deterrent Value (GKDV) β comparing the actual GK position against the ghost position to quantify positioning-as-deterrent.
Key properties:
- Density estimation, not regression: Outputs a 60Γ64 probability grid (3,840 cells at 0.5m resolution), not a single (x, y) point. Captures multimodal positioning (e.g., split between near-post and central when the ball is wide).
- No pickle: Serialized as npz (NumPy arrays) + JSON (metadata) + SHA-256 integrity sidecar. No pickle anywhere in the load/save path.
- Vectorized inference: Tree traversal uses NumPy array operations (no sklearn at inference time). Batch prediction of 1,000 frames completes in under 1 second.
- Two variants:
"default"(approx. 12 MB, 36k training frames) ships bundled in the wheel;"full"(approx. 170 MB, 887k frames) downloads from this Hub repo on first use.
Architecture
The model implements RFCDE (Pospisil & Lee 2018) adapted for goalkeeper positioning:
- Feature extraction: 26 goal-relative features per frame (ball state, defensive geometry, game context).
phaseis trained numerically (not categorical) so the pickle-free numeric tree traversal matches sklearn exactly (4.14.0; ADR-016). - Leaf assignment:
HistGradientBoostingRegressor(500 trees, max depth 8) trained on GK x-coordinate; leaf assignments partition the feature space - Co-occurrence weighting: Training frames sharing leaf assignments with the query frame receive higher weight (Dutta et al. 2024 NFL Ghosts approach)
- 2D KDE: Weighted Gaussian KDE over (x, y) positions of weighted training frames produces the density surface (
mode,mean,density_spread) - Served point estimate: a second
HistGradientBoostingRegressoris trained on GK y-coordinate;ghost_gk_x/yserve the exact boosted mean of both ensembles, reconstructed pickle-free asbaseline + Ξ£_trees leaf_value(no sklearn at inference). 4.14.0; ADR-016.
Features (26)
| Category | Features |
|---|---|
| Ball state | ball_x, ball_y, ball_vx, ball_vy, ball_distance_to_goal, ball_to_goal_angle, ball_speed |
| Defensive geometry | defensive_line_x, defensive_line_depth, defensive_line_width, defensive_line_speed, defenders_behind_ball, deepest_defender_x, defending_team_compactness, defending_centroid_vx |
| Attacking geometry | attackers_in_box, nearest_attacker_to_goal_x, attacker_centroid_x, attacker_centroid_y, ball_to_nearest_attacker_dist |
| Game context | phase, team_in_possession, score_diff, time_seconds, period_id, ball_in_own_half |
All coordinates are goal-relative: the defending goal is at x=0, pitch center at y=34.
Hyperparameters
| Parameter | Value |
|---|---|
| Algorithm | HistGradientBoostingRegressor |
| Number of trees | 500 |
| Max depth | 8 |
| Grid resolution | 0.5m (60Γ64 cells) |
| Grid coverage | x: [0, 30]m from goal line, y: [18, 50]m across pitch |
Variants
| Variant | Training frames | File size | Source |
|---|---|---|---|
default |
36,000 | 12 MB | Bundled in pip install silly-kicks |
full |
887,000 | 170 MB | Downloaded from this HF repo via pip install silly-kicks[ghost-gk] |
The default variant provides nearly identical point-estimate accuracy (mode x/y) with faster density estimation. The full variant produces smoother, more detailed density surfaces β recommended for research applications where the full density shape matters.
Training Data
Trained on licensed tracking data from professional football matches:
| Provider | Competitions | Notes |
|---|---|---|
| Sportec (DFL) | Bundesliga | Native GK identification |
| SkillCorner | Multiple leagues | Derived GK identification (ADR-007) |
| Gradient Sports | FIFA World Cup 2022 | Owner-tier source β only the trained model weights are distributed here; the underlying raw tracking data is not redistributed |
The full variant is trained on 81 matches / 887k frames across all three providers above; the default variant is a lighter 36k-frame subsample. Only the learned model parameters (tree structure, leaf-aggregated GK positions, KDE weights) are published β no raw provider tracking data is redistributed.
Training frames are filtered to remove sweeper-rush events (GK outside penalty area during active defensive actions) to ensure the ghost represents normal positioning behavior.
Label domain: GK (x, y) position in goal-relative coordinates, filtered to the grid region [0, 30] Γ [18, 50].
Usage
import silly_kicks.tracking as tracking
# Default variant (bundled, works offline)
densities = tracking.compute_ghost_gk(frames, model="default")
# Full variant (downloads from HF Hub on first use)
densities = tracking.compute_ghost_gk(frames, model="full")
# Action-coupled aggregator for VAEP integration
actions = tracking.add_ghost_gk(actions, frames, model="full")
# Direct model loading
model = tracking.GhostGkModel.from_variant("full")
density = model.predict_density(feature_vector)
print(f"Mode: ({density.mode_x:.1f}, {density.mode_y:.1f})")
print(f"Spread: {density.spread:.2f}")
Output
Each prediction returns a GhostGkDensity frozen dataclass:
| Field | Type | Description |
|---|---|---|
mode_x |
float | Joint 2D mode x (argmax), goal-relative meters |
mode_y |
float | Joint 2D mode y (argmax), goal-relative meters |
mean_x |
float | Density-weighted (grid) mean x |
mean_y |
float | Density-weighted (grid) mean y |
spread |
float | Effective area (entropy-based density dispersion measure) |
probabilities |
ndarray (60, 64) | Full density grid |
grid_x |
ndarray (60,) | X-axis cell centers |
grid_y |
ndarray (64,) | Y-axis cell centers |
The served point estimate (ghost_gk_x/y, model.predict()) is the exact boosted HGBR
predict_mean (below), reconstructed pickle-free β it is not a field of GhostGkDensity.
Served point estimate (v4.14.0)
ghost_gk_x/y and model.predict() serve the exact sklearn HistGradientBoostingRegressor boosted
mean β the same estimator the old card's β1.1 m number measured, but now reconstructed pickle-free
(baseline + Ξ£_trees leaf_value) so it survives load() and is actually served. This closes a
pre-existing integrity gap: the card reported β1.1 m for an estimator that save()/load() discarded,
while production served the KDE mode (β4.65 m):
| Estimator | Held-out euclidean MAE | Served? |
|---|---|---|
old card number (predict_mean, sklearn, phase-categorical) |
β1.1 m | never served (unavailable after load()) |
| KDE mode (β€ v4.12) | β4.65 m | served through 4.12 |
| boosted mean (4.14.0, reconstructed pickle-free) | 1.07 m (5-fold aggregate) | served now |
The 4.14.0 number is re-measured at re-fit on the same held-out split as the mode (not copied from the
old β1.1 m card, which was a different, phase-categorical model). An intermediate design that served
the leaf-weighted conditional mean (no re-fit) was empirically rejected β it measured β7.0 m, worse
than the mode, because the conditional density is broad + multimodal. The boosted mean is a structurally
stronger estimator and is the only candidate that beats the mode. The mode remains available via
predict_density(...).mode_x/mode_y. See ADR-016.
Weights re-fit + re-published (4.14.0). This is an artifact-format change (the npz now carries the
gk_y tree ensemble + baselines for the reconstruction; metadata.version = 1.2.0,
serve_estimator = "boosted_mean"), and fit() now trains phase numerically (closing a latent KDE
categorical-routing capability gap). Both the bundled default and this Hub full model are re-fit;
old-format artifacts fail closed on load with a clear "re-fit required" error.
Breaking: the emitted spread column is renamed
ghost_gk_spreadβghost_gk_density_spread(it is the conditional-density dispersion, not the served point's standard error). Lakehouse consumers must rename on consume + re-materializeghost_gk_*.
Serialization Format
model_dir/
rfcde_weights.npz # NumPy arrays: leaf assignments, training positions, tree structure
metadata.json # Feature names, grid spec, hyperparameters, version
SHA256SUMS # Integrity checksums (CRLF-normalized for cross-platform safety)
No pickle is used anywhere in the serialization or deserialization path.
Coordinate System
Input frames must be in LTR-normalized convention (home team attacks right in all periods β the standard silly-kicks tracking output after play_left_to_right normalization).
Features are extracted in goal-relative coordinates:
- Origin: defending goal center (x=0, y=34)
- The defending goal is inferred per (game_id, period_id, team_id) from mean GK x position
Limitations
- League-average ghost: The model predicts where an average goalkeeper would stand, not where a specific goalkeeper would stand. Stylistic differences (sweeper-keeper vs. line-keeper) are averaged out.
- No shot-stopping ability: Ghost-GK models positioning, not reactions. It does not predict save probability or diving reach.
- Tracking data quality: Predictions inherit noise from the underlying tracking system. SkillCorner broadcast-derived coordinates are noisier than optical systems (Sportec DFL).
- LTR normalization required: Input frames must be LTR-normalized. Feeding raw provider coordinates produces incorrect goal-relative features.
- Static density: Each frame produces an independent density estimate. Temporal smoothing is not built into the model (apply externally if needed).
References
@inproceedings{le2017ghosting,
title={Data-Driven Ghosting Using Deep Imitation Learning},
author={Le, Hoang M. and Yue, Yisong and Carr, Peter and Lucey, Patrick},
booktitle={MIT Sloan Sports Analytics Conference},
year={2017}
}
@article{dutta2024nflghosts,
title={NFL Ghosts: A framework for evaluating defender positioning
with conditional density estimation},
author={Dutta, Rishav and Yurko, Ronald and Ventura, Samuel},
journal={arXiv preprint arXiv:2406.17220},
year={2024}
}
@article{pospisil2018rfcde,
title={RFCDE: Random Forests for Conditional Density Estimation},
author={Pospisil, Taylor and Lee, Ann B.},
journal={arXiv preprint arXiv:1804.05753},
year={2018}
}
@software{nielsen2026ghostgk,
title={Ghost-GK: Conditional Density Estimation for Goalkeeper Positioning},
author={Nielsen, Karsten Skyt},
year={2026},
url={https://github.com/karsten-s-nielsen/silly-kicks}
}
Model Files
| File | Size | Description |
|---|---|---|
rfcde_weights.npz |
170 MB | gk_x + gk_y tree structure + baselines, leaf assignments, training GK positions |
metadata.json |
1 KB | Feature names, grid specification, hyperparameters, serve_estimator, version |
SHA256SUMS |
164 B | Integrity checksums |
More Information
- License: MIT (same as silly-kicks)
- Library: silly-kicks (v3.24.0+)
- Documentation: silly-kicks GitHub
- Research program: GKDV (GK Deterrent Value) β TF-15 through TF-19