How to Read Dispersion and Skew in Football Models (And Make Smarter Decisions)?
From Averages to Extremes: What Dispersion and Skew Really Tell Us About Football?
If you like data-driven football analysis, consider subscribing to the Football Hacking newsletter (free or premium).
Premium members get full access to my tools, models, and the complete dataset behind the web app across all leagues in my database.
You can also explore the live models here:
👉 https://footballhacking.streamlit.app/
Poisson-based models are everywhere in football analytics. They’re elegant, interpretable, and often very effective—when the competition behaves in a reasonably stable way.
But not all leagues do.
Some competitions produce tight, controlled matches. Others regularly explode into wild scorelines. If you ignore that difference, your model will quietly misprice the tails—exactly where big wins, big losses, and big mistakes live.
That’s why I rely on a small set of diagnostic metrics to understand how close reality is to a Poisson world and how the extremes behave:
d_home: dispersion of home goals
d_away: dispersion of away goals
d_total_goals: dispersion of total goals
d_skellam: dispersion of goal difference
skew: asymmetry of extremes (which side produces more blowouts)
Let’s go through them in practical terms.
Why Dispersion Matters
In a “perfect” Poisson process, variance ≈ mean. So we track dispersion as:
Dispersion = Variance / Mean
≈ 1.0 → The data behaves like Poisson expects.
> 1.0 → Overdispersion: more chaos, fatter tails, more extreme outcomes than the model assumes.
< 1.0 → Underdispersion: results are more concentrated and controlled than the model assumes.
This directly affects:
How often your model misses blowouts and collapses
Whether you overprice or underprice extreme scorelines
How much you should trust handicaps, totals, and margin markets
d_home: Dispersion of Home Goals
d_home tells you how variable home team goals are compared to a Poisson process.
≈ 1.0 → Home goals behave like Poisson.
> 1.0 → Home scoring is more volatile than expected: more games where the home team either stalls or explodes. The model tends to underestimate big home wins.
< 1.0 → Home goals are more controlled and concentrated. The model may overestimate extreme home scores.
Decision tip:
If d_home is high, be cautious with large home handicaps and extreme home scorelines.
d_away: Dispersion of Away Goals
d_away measures the same thing for away team goals.
≈ 1.0 → Away goals fit Poisson well.
> 1.0 → Away scoring is more chaotic than expected. The model may underestimate rare high away scores.
< 1.0 → Away goals are more predictable and concentrated. The model may overestimate extreme away scores.
Decision tip:
If d_away is low, be skeptical of markets that rely heavily on high away scoring (for example, aggressive away team totals or BTTS driven mainly by the away side).
d_total_goals: Dispersion of Total Goals
d_total_goals looks at home + away goals and is especially relevant for Over/Under markets.
≈ 1.0 → Total goals behave like Poisson expects.
> 1.0 → Matches are more chaotic in total scoring. The model may underestimate very high or very low totals.
< 1.0 → Matches are more controlled. The model may overestimate extreme Overs and wild scorelines.
Decision tip:
If d_total_goals is high, be more conservative with high Over lines and extreme totals.
d_skellam: Dispersion of Goal Difference
Goal difference (home − away) follows a Skellam distribution if both teams are Poisson. A practical diagnostic is:
d_skellam = Var(Home − Away) / Mean(Total Goals)
≈ 1.0 → Winning margins look like the model expects.
> 1.0 → Margins are more spread out: more big wins and heavy defeats. The model may underestimate wide handicaps.
< 1.0 → Margins are more concentrated: more narrow wins and draws. The model may overestimate big winning margins.
Decision tip:
If d_skellam is high, be extra cautious with large handicaps and exact margin markets.
Skew: This Is NOT Home Advantage
Here’s the crucial clarification:
Skew does NOT measure home advantage or away advantage.
Home advantage is about average performance.
Skew is about the tails: it tells you to which side the extreme results (blowouts, big margins) tend to go.
In football terms, skew answers this question:
When games break, do they break more often into big home wins or big away wins?
Skew ≈ 0 → Symmetric tails. Big home wins and big away wins happen with similar frequency.
Skew > 0 → Right-skewed. Extreme results are more often big home wins. The model may underestimate very large home victories.
Skew < 0 → Left-skewed. Extreme results are more often big away wins. The model may underestimate very large away victories.
This is about blowouts and heavy margins, not about who is better on average.
You can have:
Strong home advantage but skew ≈ 0 (extremes balanced), or
Modest home advantage but positive skew (when games explode, they usually explode in favor of the home side).
Decision tip:
Positive skew → be careful with large home handicaps being too cheap.
Negative skew → be careful with large away handicaps being too cheap.
How to Use These Metrics Together
Think of them as context and risk filters:
Dispersion near 1.0 → Poisson assumptions are well supported.
Dispersion well above 1.0 → More chaos → be conservative with extremes.
Dispersion below 1.0 → More control → extremes are rarer than the model suggests.
Skew tells you which side tends to produce the blowouts.
They don’t replace your model. They tell you how much you should trust it, and where it’s likely to fail first.
Explore This in the Football Hacking Web App
All these metrics are available in my tools so you can inspect league behavior before trusting any prediction.
👉 Visit the app: https://footballhacking.streamlit.app/
And an important note:
Premium subscribers get access to the full dataset behind the web app for all leagues in my database.
That means you can download and analyze the same data used in the models.
Final Call to Action
If you care about:
Football analytics
Probabilistic modeling
Understanding when a model should (and should not) be trusted
👉 Subscribe to the Football Hacking newsletter (free or premium).
👉 Explore the web app and the data:
The real edge is not just building models—it’s understanding the environments they live in. That’s how you avoid being fooled by averages and start respecting the extremes.


