Chapter 4 · Probability, calibration, and edge

Everything polyAether does rests on three plain ideas: what a probability means, what it means to be honest about your probabilities, and how you turn an honest probability into money. This chapter builds all three from zero.

01 — What a probability really means

A probability is just a number between 0 and 1 (or, if you prefer, between 0% and 100%) that measures how likely something is. Zero means "impossible." One means "certain." Everything interesting lives in between.

The cleanest way to feel what a probability is comes from a die. Roll a fair six-sided die. The probability of rolling a 4 is 1/6, or about 17%. What does that number actually promise? Not that your next roll will be a 4 — a single roll is a single roll. It promises something about the long run: if you roll the die ten thousand times, roughly one-sixth of those rolls will come up 4. The probability is the fraction you'd see if you could repeat the situation over and over.

Weather works the same way, even though you can't literally re-roll a day. When a forecast says "70% chance of rain tomorrow," it is making a long-run claim about a whole class of days: across all the days that looked like this one — same season, same pressure patterns, same everything the forecast can see — it rained on about 70% of them. Tomorrow is one draw from that pile. It will either rain or it won't; the 70% describes the pile, not the single day.

Key idea

A probability is a long-run frequency. "70%" doesn't predict tomorrow — it says that among all the days like this one, about 7 in 10 turn out that way. A single outcome can never prove a probability right or wrong; only many outcomes can.

Why one outcome tells you almost nothing

This is the trap that fools almost everyone. Suppose someone says "80% chance of sun" and it rains. Were they wrong? No — not from that one day. An 80% forecast is supposed to be wrong about 1 day in 5. If it were never wrong, it wasn't really 80%, it was more like 100%. Rare things happening at their promised rate is exactly what a good forecast looks like. To judge a probability, you have to zoom out and count many days.

02 — Calibration: the test of an honest probability

So if a single day can't grade a forecaster, what can? The answer is calibration. A forecaster is well-calibrated when their stated probabilities match reality in the long run: of all the times they say "70%," the thing actually happens about 70% of the time. Of all the times they say "30%," it happens about 30% of the time. And so on, up and down the whole range.

Here's the honest way to picture it. Collect every day a forecaster said "70% chance of rain." Maybe that's 200 days. Now count: on how many did it actually rain? If it's around 140 (which is 70% of 200), they're calibrated at that level. If it rained on only 100 of them, they were overconfident — they said 70% but reality was 50%. If it rained on 180, they were underconfident. Calibration is simply: do your numbers mean what they say?

Our forecasts, grouped by stated probability

A well-calibrated forecaster's dots sit on the dashed line: when they say 40%, it happens 40% of the time. Falling below the line means overconfidence; above it means underconfidence.

Notice what calibration is not. It is not about being right on any single day, and it is not about being bold. A forecaster who simply says "50%" about everything can look vaguely reasonable but is useless — they're not telling you anything specific. Good forecasting needs two things together: probabilities that are sharp (close to 0 or 100 when the situation warrants) and calibrated (meaning what they say). polyAether cares about calibration above all, because a probability you can't trust is worse than no probability at all.

Key idea

Well-calibrated means your numbers are honest: when you say 70%, it happens about 70% of the time — checked across hundreds of cases, not one. Calibration is the only thing that makes a probability safe to bet on.

03 — Edge: our probability minus the market price

Now we connect the forecast to the money. Recall from Chapter 2 that in a prediction market, a contract that pays $1 if some event happens trades at a price that behaves like a probability. If "Rain in Chicago today?" is priced at 55 cents, the market is effectively saying "55% chance." Buy it for 55 cents; if it rains you collect $1, if it doesn't you get nothing.

So now we have two probabilities sitting side by side for the very same event:

The market price — the crowd's probability, in cents.
Our model probability — what our weather ensemble (the ~122-member GFS + ICON + ECMWF forecast from Chapter 3) says.

The gap between them is the whole game. We call it the edge:

Edge = our probability − the market price

Suppose the market is charging 55 cents for "Rain today," but our calibrated ensemble puts the true chance at 70%. We think a $1 payout is worth about 70 cents, and we can buy it for 55. That 15-cent gap is our edge — the amount we believe the market has mispriced the contract in our favor. If our probability is honest, buying at 55 is buying a 70-cent thing at a discount, over and over.

Edge can point the other way too. If the market charges 80 cents but we think the real chance is only 60%, there's no edge in buying — the contract is overpriced by our lights, and we pass (or, where possible, take the other side). And when our number and the market's number roughly agree, edge is near zero and there's simply nothing to do. No edge, no trade. Most days, most markets, that's the correct answer.

Key idea

Edge = our probability minus the market price. It's the discount (or premium) between what we think a contract is worth and what it costs. Positive edge is the only reason to place a bet; when the two numbers agree, we sit still.

Why edge only works if we're calibrated

Here is the thread that ties the chapter together. Edge is measured against our probability — so if our probability is dishonest, our edge is a fantasy. Imagine we're chronically overconfident: we say 70% when reality is only 55%. We'll "see" a fat 15-cent edge against a 55-cent market that is, in fact, perfectly priced. We'd bet confidently into markets where we have no advantage at all, and slowly bleed money while feeling clever.

That's why calibration isn't an academic nicety — it's the safety catch on the whole machine. An edge is only real if the probability behind it is trustworthy. This is exactly why polyAether validates its numbers against real outcomes before risking a cent, and why it's still strictly paper-trading with no proven track record. A believable edge starts with a believable probability.

In the next chapter we'll turn to the specific, repeatable reason these gaps exist at all: the crowd systematically overpays for surprises — pricing uncertain, dramatic outcomes at roughly 1.3× their true odds. That predictable mistake is where our edge comes from.