Ghost-GK v1 — Conditional Density Estimation for Goalkeeper Positioning

Predicts where a league-average goalkeeper would position themselves given the current game state. Uses RFCDE (Random Forest Conditional Density Estimation) over HistGradientBoostingRegressor leaf assignments with weighted 2D kernel density estimation.

Part of the silly-kicks soccer analytics library (GKDV research program, TF-18, Layer 2).

Model Description

Standard goalkeeper evaluation metrics (xGOT, save percentage, goals prevented) measure what happens after a shot is taken. Ghost-GK addresses the upstream question: given the current game state, where should the goalkeeper be standing?

The model learns a league-average positional density from thousands of match frames across multiple tracking data providers. For any given frame, it outputs a full 2D probability distribution over the goal-relative region, not just a point estimate. This enables downstream metrics like the GK Deterrent Value (GKDV) — comparing the actual GK position against the ghost position to quantify positioning-as-deterrent.

Key properties:

Density estimation, not regression: Outputs a 60×64 probability grid (3,840 cells at 0.5m resolution), not a single (x, y) point. Captures multimodal positioning (e.g., split between near-post and central when the ball is wide).
No pickle: Serialized as npz (NumPy arrays) + JSON (metadata) + SHA-256 integrity sidecar. No pickle anywhere in the load/save path.
Vectorized inference: Tree traversal uses NumPy array operations (no sklearn at inference time). Batch prediction of 1,000 frames completes in under 1 second.
Two variants: "default" (approx. 12 MB, 36k training frames) ships bundled in the wheel; "full" (approx. 170 MB, 887k frames) downloads from this Hub repo on first use.

Architecture

The model implements RFCDE (Pospisil & Lee 2018) adapted for goalkeeper positioning:

Feature extraction: 26 goal-relative features per frame (ball state, defensive geometry, game context). phase is trained numerically (not categorical) so the pickle-free numeric tree traversal matches sklearn exactly (4.14.0; ADR-016).
Leaf assignment: HistGradientBoostingRegressor (500 trees, max depth 8) trained on GK x-coordinate; leaf assignments partition the feature space
Co-occurrence weighting: Training frames sharing leaf assignments with the query frame receive higher weight (Dutta et al. 2024 NFL Ghosts approach)
2D KDE: Weighted Gaussian KDE over (x, y) positions of weighted training frames produces the density surface (mode, mean, density_spread)
Served point estimate: a second HistGradientBoostingRegressor is trained on GK y-coordinate; ghost_gk_x/y serve the exact boosted mean of both ensembles, reconstructed pickle-free as baseline + Σ_trees leaf_value (no sklearn at inference). 4.14.0; ADR-016.

Features (26)

Category	Features
Ball state	`ball_x`, `ball_y`, `ball_vx`, `ball_vy`, `ball_distance_to_goal`, `ball_to_goal_angle`, `ball_speed`
Defensive geometry	`defensive_line_x`, `defensive_line_depth`, `defensive_line_width`, `defensive_line_speed`, `defenders_behind_ball`, `deepest_defender_x`, `defending_team_compactness`, `defending_centroid_vx`
Attacking geometry	`attackers_in_box`, `nearest_attacker_to_goal_x`, `attacker_centroid_x`, `attacker_centroid_y`, `ball_to_nearest_attacker_dist`
Game context	`phase`, `team_in_possession`, `score_diff`, `time_seconds`, `period_id`, `ball_in_own_half`

All coordinates are goal-relative: the defending goal is at x=0, pitch center at y=34.

Hyperparameters

Parameter	Value
Algorithm	`HistGradientBoostingRegressor`
Number of trees	500
Max depth	8
Grid resolution	0.5m (60×64 cells)
Grid coverage	x: [0, 30]m from goal line, y: [18, 50]m across pitch

Variants

Variant	Training frames	File size	Source
`default`	36,000	12 MB	Bundled in `pip install silly-kicks`
`full`	887,000	170 MB	Downloaded from this HF repo via `pip install silly-kicks[ghost-gk]`

The default variant provides nearly identical point-estimate accuracy (mode x/y) with faster density estimation. The full variant produces smoother, more detailed density surfaces — recommended for research applications where the full density shape matters.

Training Data

Trained on licensed tracking data from professional football matches:

Provider	Competitions	Notes
Sportec (DFL)	Bundesliga	Native GK identification
SkillCorner	Multiple leagues	Derived GK identification (ADR-007)
Gradient Sports	FIFA World Cup 2022	Owner-tier source — only the trained model weights are distributed here; the underlying raw tracking data is not redistributed

The full variant is trained on 81 matches / 887k frames across all three providers above; the default variant is a lighter 36k-frame subsample. Only the learned model parameters (tree structure, leaf-aggregated GK positions, KDE weights) are published — no raw provider tracking data is redistributed.

Training frames are filtered to remove sweeper-rush events (GK outside penalty area during active defensive actions) to ensure the ghost represents normal positioning behavior.

Label domain: GK (x, y) position in goal-relative coordinates, filtered to the grid region [0, 30] × [18, 50].

Usage

import silly_kicks.tracking as tracking

# Default variant (bundled, works offline)
densities = tracking.compute_ghost_gk(frames, model="default")

# Full variant (downloads from HF Hub on first use)
densities = tracking.compute_ghost_gk(frames, model="full")

# Action-coupled aggregator for VAEP integration
actions = tracking.add_ghost_gk(actions, frames, model="full")

# Direct model loading
model = tracking.GhostGkModel.from_variant("full")
density = model.predict_density(feature_vector)
print(f"Mode: ({density.mode_x:.1f}, {density.mode_y:.1f})")
print(f"Spread: {density.spread:.2f}")

Output

Each prediction returns a GhostGkDensity frozen dataclass:

Field	Type	Description
`mode_x`	float	Joint 2D mode x (argmax), goal-relative meters
`mode_y`	float	Joint 2D mode y (argmax), goal-relative meters
`mean_x`	float	Density-weighted (grid) mean x
`mean_y`	float	Density-weighted (grid) mean y
`spread`	float	Effective area (entropy-based density dispersion measure)
`probabilities`	ndarray (60, 64)	Full density grid
`grid_x`	ndarray (60,)	X-axis cell centers
`grid_y`	ndarray (64,)	Y-axis cell centers

The served point estimate (ghost_gk_x/y, model.predict()) is the exact boosted HGBR predict_mean (below), reconstructed pickle-free — it is not a field of GhostGkDensity.

Served point estimate (v4.14.0)

ghost_gk_x/y and model.predict() serve the exact sklearn HistGradientBoostingRegressor boosted mean — the same estimator the old card's ≈1.1 m number measured, but now reconstructed pickle-free (baseline + Σ_trees leaf_value) so it survives load() and is actually served. This closes a pre-existing integrity gap: the card reported ≈1.1 m for an estimator that save()/load() discarded, while production served the KDE mode (≈4.65 m):

Estimator	Held-out euclidean MAE	Served?
old card number (`predict_mean`, sklearn, phase-categorical)	≈1.1 m	never served (unavailable after `load()`)
KDE mode (≤ v4.12)	≈4.65 m	served through 4.12
boosted mean (4.14.0, reconstructed pickle-free)	1.07 m (5-fold aggregate)	served now

The 4.14.0 number is re-measured at re-fit on the same held-out split as the mode (not copied from the old ≈1.1 m card, which was a different, phase-categorical model). An intermediate design that served the leaf-weighted conditional mean (no re-fit) was empirically rejected — it measured ≈7.0 m, worse than the mode, because the conditional density is broad + multimodal. The boosted mean is a structurally stronger estimator and is the only candidate that beats the mode. The mode remains available via predict_density(...).mode_x/mode_y. See ADR-016.

Weights re-fit + re-published (4.14.0). This is an artifact-format change (the npz now carries the gk_y tree ensemble + baselines for the reconstruction; metadata.version = 1.2.0, serve_estimator = "boosted_mean"), and fit() now trains phase numerically (closing a latent KDE categorical-routing capability gap). Both the bundled default and this Hub full model are re-fit; old-format artifacts fail closed on load with a clear "re-fit required" error.

Breaking: the emitted spread column is renamed ghost_gk_spread → ghost_gk_density_spread (it is the conditional-density dispersion, not the served point's standard error). Lakehouse consumers must rename on consume + re-materialize ghost_gk_*.

Serialization Format

model_dir/
  rfcde_weights.npz    # NumPy arrays: leaf assignments, training positions, tree structure
  metadata.json        # Feature names, grid spec, hyperparameters, version
  SHA256SUMS           # Integrity checksums (CRLF-normalized for cross-platform safety)

No pickle is used anywhere in the serialization or deserialization path.

Coordinate System

Input frames must be in LTR-normalized convention (home team attacks right in all periods — the standard silly-kicks tracking output after play_left_to_right normalization).

Features are extracted in goal-relative coordinates:

Origin: defending goal center (x=0, y=34)
The defending goal is inferred per (game_id, period_id, team_id) from mean GK x position

Limitations

League-average ghost: The model predicts where an average goalkeeper would stand, not where a specific goalkeeper would stand. Stylistic differences (sweeper-keeper vs. line-keeper) are averaged out.
No shot-stopping ability: Ghost-GK models positioning, not reactions. It does not predict save probability or diving reach.
Tracking data quality: Predictions inherit noise from the underlying tracking system. SkillCorner broadcast-derived coordinates are noisier than optical systems (Sportec DFL).
LTR normalization required: Input frames must be LTR-normalized. Feeding raw provider coordinates produces incorrect goal-relative features.
Static density: Each frame produces an independent density estimate. Temporal smoothing is not built into the model (apply externally if needed).

References

@inproceedings{le2017ghosting,
  title={Data-Driven Ghosting Using Deep Imitation Learning},
  author={Le, Hoang M. and Yue, Yisong and Carr, Peter and Lucey, Patrick},
  booktitle={MIT Sloan Sports Analytics Conference},
  year={2017}
}

@article{dutta2024nflghosts,
  title={NFL Ghosts: A framework for evaluating defender positioning
         with conditional density estimation},
  author={Dutta, Rishav and Yurko, Ronald and Ventura, Samuel},
  journal={arXiv preprint arXiv:2406.17220},
  year={2024}
}

@article{pospisil2018rfcde,
  title={RFCDE: Random Forests for Conditional Density Estimation},
  author={Pospisil, Taylor and Lee, Ann B.},
  journal={arXiv preprint arXiv:1804.05753},
  year={2018}
}

@software{nielsen2026ghostgk,
  title={Ghost-GK: Conditional Density Estimation for Goalkeeper Positioning},
  author={Nielsen, Karsten Skyt},
  year={2026},
  url={https://github.com/karsten-s-nielsen/silly-kicks}
}

Model Files

File	Size	Description
`rfcde_weights.npz`	170 MB	gk_x + gk_y tree structure + baselines, leaf assignments, training GK positions
`metadata.json`	1 KB	Feature names, grid specification, hyperparameters, `serve_estimator`, version
`SHA256SUMS`	164 B	Integrity checksums

More Information

License: MIT (same as silly-kicks)
Library: silly-kicks (v3.24.0+)
Documentation: silly-kicks GitHub
Research program: GKDV (GK Deterrent Value) — TF-15 through TF-19

Downloads last month: -; Downloads are not tracked for this model. How to track

Papers for silly-kicks/ghost-gk-v1

NFL Ghosts: A framework for evaluating defender positioning with conditional density estimation

Paper • 2406.17220 • Published Jun 25, 2024

RFCDE: Random Forests for Conditional Density Estimation

Paper • 1804.05753 • Published Apr 16, 2018