xG models for football bets: step-by-step over under tips football guide

Table of Contents

Why expected goals (xG) should change the way you bet over/under

You probably look at recent scorelines, injuries, and a bookmaker’s total when sizing up an over/under market. xG (expected goals) gives you a different, often more reliable lens. Instead of simply counting goals scored, xG estimates the quality of chances teams create and concede. That means you can spot games where the final score was misleading — a 0-0 might hide multiple near-misses, and a 4-3 thriller might have relied on a fluke or two.

For betting over/under, that distinction is crucial. Totals markets are driven by chance conversion as much as by chance creation. When you use xG, you separate the probability of goals occurring from random variance. You start predicting whether a match is likely to produce more or fewer goals than the market expects, not just whether a team will win.

What you need to understand about xG components before modeling totals

Before you build or trust an xG model for over/under bets, get comfortable with the following core elements. Each affects how reliably xG translates into predicted goal totals.

1. Shot quality and quantity

Shot quality: Every attempt is assigned a probability of scoring based on location, body part, assist type, and defensive pressure. Higher-quality shots mean higher xG.
Shot quantity: Two teams that create many low-quality shots can still outperform a single high-quality chance; both volume and quality matter when projecting goals.

2. Defensive context and expected goals against (xGA)

xGA measures the quality of chances a team concedes. Teams may appear solid by scoreline but allow dangerous chances regularly — a red flag for over/under bets.
Context matters: a defensive unit’s xGA can rise or fall with injuries, tactics, or opponent strength.

3. Game state and tactical influence

Game states (leading, drawing, trailing) change how teams play. A side that often concedes late goals after taking the lead will influence the distribution of goals across 90 minutes. When modeling totals, you must account for how likely each team is to be attacking or defending at different periods.

4. Sample size and league-normalization

Small samples (a few matches) are noisy. Use several dozen matches for team-level xG rates if possible.
Different leagues have different scoring environments. Normalize xG to league averages before comparing teams from different competitions.

With these components in mind, you’ll be better positioned to interpret raw xG numbers and avoid common pitfalls (overreacting to one high-xG match, ignoring tactical shifts, or treating xG as a final score prediction rather than a probability). In the next section, you’ll walk through building a simple xG-based model step by step, including data sources, basic calculations, and how to convert xG into over/under probabilities.

Gathering and cleaning xG data: practical sources and preprocessing steps

Start by collecting reliable event-level xG data — not just final match xG totals. Providers such as StatsBomb, Opta, Wyscout, and open sources (FBref/StatsBomb public datasets) give shot-level xG with context (location, assist type, body part). For a simple totals model you need: shot xG per team per match, match minute for each shot (if you’ll do game-state adjustments), and basic match metadata (home/away, competition, date).

Preprocessing checklist:

Filter by competition and season. Combine seasons only after league-normalizing (see below).
Remove outliers and garbage data (mis-tagged shots, abandoned games). Check for missing shot coordinates or null xG values.
Aggregate to team-level rates: compute attack xG per 90 (xG for) and defense xG per 90 (xG against) over your chosen sample window.
Weight matches by recency. A simple exponential decay (e.g., weight = 0.95^weeks_ago) balances recent form with longer-term ability.
Normalize across leagues: calculate league-average xG per 90 and scale team rates so they’re comparable (team_rate_norm = team_rate_raw * league_avg_overall / league_avg_raw).

Keep the sample-size problem front of mind: allow at least 10–20 matches for reasonable team estimates; more if possible. For promoted/relegated teams or managerial changes, use a heavier recent weighting or supplement with roster-adjusted priors.

From team xG to match totals: modeling and converting to over/under probabilities

With cleaned, normalized attack and defense rates, build expected goals for each team in a fixture. A straightforward baseline:

Expected goals for Home = Home_attack_xG90 Away_defense_xGA90 / League_avg_xG90 Home_minutes_factor
Expected goals for Away = Away_attack_xG90 * Home_defense_xGA90 / League_avg_xG90

Include a home advantage multiplier (often ~1.05–1.15 depending on league). If you weighted by minutes or game states, adjust the minutes_factor accordingly.

Next, choose how to turn two expected-goal numbers into a distribution of total goals. Options ranked by simplicity and realism:

Independent Poisson model (simple): treat each team’s goals as Poisson(λ). Sum of two independent Poissons gives the total-goals distribution directly. This is fast and often surprisingly effective for totals.
Bivariate Poisson (better): accounts for correlation between teams’ scoring (e.g., open games increasing both sides’ chances). It requires estimating a covariance parameter from historical data.
Monte Carlo simulation (flexible): sample goals from distributions that can include overdispersion (negative binomial) and incorporate game-state dynamics or late-match scoring tendencies. Simulate 10,000+ matches to estimate probabilities for >2.5, >1.5, etc.

Calibration and adjustments:

Calibrate your model on a holdout set: compare predicted probabilities (e.g., model says 60% for over 2.5) to actual frequencies and adjust. Apply simple scaling to correct systematic over- or under-estimation.
Account for variance: Poisson may understate variance in high-scoring leagues. If residuals show overdispersion, switch to a negative-binomial or inflate variance when simulating.
Game-state modifiers: if you have minute-by-minute xG, simulate tactical shifts — teams trailing late may push and raise combined xG. Even a crude rule (increase trailing team’s λ by 10–20% after 60 minutes) can improve accuracy for totals markets.

Finally, convert model probabilities to implied odds and compare with market lines. A profitable edge exists only when your model’s probability significantly exceeds the market-implied probability after accounting for bookmaker margin and model uncertainty. In the next part we’ll discuss comparing model outputs to bookmakers, sizing stakes, and live in-play adjustments.

Putting xG models into practice

Building an xG-based over/under model is only half the job — the other half is disciplined implementation. Treat your model as a tool for probabilistic thinking rather than a prediction oracle. Backtest thoroughly, quantify uncertainty, and be conservative about the edges you claim to find. Keep a public or private record of bets and outcomes to avoid hindsight bias and to measure true return on investment.

Backtest on holdout seasons and track calibration: compare predicted probabilities to actual frequencies and correct systematic bias.
Manage variance: size stakes according to the confidence in each edge, use a Kelly-fraction or flat-percentage staking plan, and protect bankroll against long losing runs.
Monitor market dynamics: odds move for a reason — lineup news, weather, or sharp money. Cross-check model edges against those signals before committing.
Iterate and automate: automate data pipelines (for example, using publicly available sources such as StatsBomb Open Data), but review model adjustments manually to avoid overfitting.
Respect limits and liquidity: smaller markets or lower leagues can offer mispricings but may also have wide margins and low liquidity — factor that into stake sizing.

Above all, be patient. Edges from xG come from consistent, small advantages and disciplined execution. Continue refining your model, treating it as an evolving system that benefits from better data, realistic variance assumptions, and careful market comparison.

Frequently Asked Questions

How many matches do I need for reliable team xG estimates?

Aim for at least 10–20 matches as a minimum; more is better. Use exponential recency weighting to combine recent form with longer-term ability, and increase the sample when possible to reduce noise. Promotional teams or teams with major roster changes deserve extra adjustment or priors.

Can I use xG models for live (in-play) over/under betting?

Yes — but live betting requires minute-level data and a model that accounts for game state shifts (substitutions, red cards, and tactical changes). Simple live rules (e.g., increase trailing team’s λ after 60 minutes) help, but accurate in-play edges demand fast data, low-latency odds monitoring, and strict money management.

Should I use Poisson or a more complex distribution for totals?

Poisson is a good starting point for speed and simplicity, but check for overdispersion. If your residuals show greater variance than Poisson predicts, consider negative-binomial models, bivariate Poisson to model correlation, or Monte Carlo simulations to capture tactical and game-state variability.