A forecast is not a guess and it is not a promise. It is something in between: a careful calculation that says here is what will probably happen, and here is how sure I am. Understanding that middle ground is the whole point of this chapter.
In Chapter 2 we met prediction markets — places where a price acts like a probability. In this book we care about markets that ask weather questions: Will tomorrow's high temperature in New York be above 90°F? To have any hope of trading such a market well, you first need to understand where a weather forecast comes from, and — just as importantly — why it can never be perfectly certain. That uncertainty is not a flaw. It turns out to be exactly where the opportunity lives.
The atmosphere is physics, not magic
Weather feels chaotic and moody, but the air above your head obeys a small set of physical laws. Warm air rises. Pressure differences push air from high to low, and we feel that push as wind. Water evaporates, cools, and condenses into clouds and rain. None of this is mysterious — it is the same physics that makes a kettle boil, just spread over a whole planet.
Because these laws are known, they can be written as equations — mathematical rules that say given the air's temperature, pressure, humidity, and motion right now, here is what they will be a few seconds from now.
A numerical weather model (a computer program that simulates the atmosphere using those physics equations) does exactly this, over and over.
Here is the mental picture. Imagine dividing the whole atmosphere into a giant 3-D grid of boxes, like a colossal stack of sugar cubes wrapped around the Earth. For each box the computer stores a few numbers: how warm the air is, how much moisture it holds, which way and how fast it is moving. Then it applies the physics equations to nudge every box forward one small time-step, using its neighbors' values. Repeat that step millions of times and you have marched the simulated weather hours or days into the future.
A forecast is a physics simulation. Start from the atmosphere's current state, apply known physical laws step by step, and watch where the numbers go. It is closer to predicting the path of a thrown ball than to reading tea leaves.
Two unavoidable sources of doubt
If the physics is known, why isn't the forecast perfect? Two reasons, and both are baked in permanently — no amount of better computers fully removes them.
1. We never know the starting point exactly
The simulation needs to begin from the atmosphere's current state. But we measure that state from a scattered patchwork — weather stations, balloons, aircraft, satellites, buoys. There is no thermometer in every one of those millions of grid boxes. So the starting numbers are always a little bit wrong: a fraction of a degree off here, a slightly mis-measured wind there.
2. Small errors grow
You might hope a tiny starting error stays tiny. In weather, it does the opposite. The atmosphere is a chaotic system — meaning small differences in the starting conditions can grow into large differences later. This is the famous butterfly effect
: not that a butterfly literally causes a storm, but that an error too small to measure can, days later, be the difference between rain and sun.
A good analogy is a marble balanced at the top of a smooth hill. Give it the faintest nudge left and it rolls down the left side; nudge it right and it goes right. The nudges are almost identical; the outcomes are opposite. The atmosphere is full of moments like that marble.
Forecast uncertainty is not laziness or bad equipment. It comes from two permanent facts: we can only measure today's weather approximately, and tiny approximations grow over time. A forecast that claims total certainty is lying.
What an ensemble is
Here is the beautiful trick that makes uncertainty measurable instead of just admitted.
If we cannot know the exact starting point, we do not run the simulation once and pretend we do. Instead we run it many times. Each run starts from a slightly different, equally plausible version of right now
— we deliberately jiggle the starting numbers within the range of our measurement uncertainty, and we also nudge the model's own internal settings a little. This whole collection of runs is called an ensemble (a group of forecasts run together, each from a slightly tweaked starting point).
Now watch what the ensemble tells you — for free. Suppose you are forecasting tomorrow's high in a city and you run 100 tweaked simulations. If 90 of them land above 90°F and 10 land below, you have a natural, honest number: roughly a 90% chance of clearing 90°F. The fraction of runs that produce an outcome is the probability of that outcome. Counting how many copies of the future agree turns physics into odds.
The spread — how much the runs disagree — is itself the message:
- If nearly all the runs cluster tightly around the same answer, the forecast is confident. Small nudges didn't change the story, so we're on the flat part of the hill, far from any balanced marble.
- If the runs scatter all over the place, the forecast is uncertain. Tiny differences blew up into wildly different weather — we're near a marble on a hilltop, and honesty demands we say so.
An ensemble runs the same forecast many times with tiny tweaks to the starting point. The fraction of runs that agree on an outcome is its probability, and how tightly the runs cluster tells you how confident to be. This is how a physics simulation produces an honest number like 70% chance of rain.
More opinions, better odds
No single model is perfect, so the strongest approach is to blend several. Different national weather centers have built different models, each with its own strengths. polyAether pulls together an ensemble of about 122 members drawn from three of the world's leading systems — the American GFS, the German ICON, and the European ECMWF (each is just the name of a major weather model). Combining many members from several independent systems gives a broader, more trustworthy spread than any one model alone.
Those forecasts are checked against reality at roughly 80 curated weather stations — specific, reliable measurement sites — because in the end a bet on the high in New York
is settled by what one real thermometer records. (How a market decides who won turns out to be surprisingly fiddly; that's the subject of Chapter 7.)
An ensemble hands us a probability the market has to compete with. If our 122 runs say 72% chance,
and the market price implies 60%,
we may have found a disagreement worth acting on. But — as the next chapters show — a raw model number is not yet a trustworthy one. It has to be calibrated first.
From weather to edge
You now have the core machinery. A forecast is a physics simulation; it is uncertain because we can't measure the start perfectly and errors grow; and an ensemble converts that uncertainty into an actual probability by running the simulation many times and counting the outcomes.
The next question is sharper: when the model says 70%,
does it really happen 70% of the time? Turning a model's number into a probability you can bet real money on is the work of Chapter 4, on probability, calibration, and edge — where we'll see why markets systematically overpay for uncertainty (by a factor of roughly 1.3), and how a well-behaved forecast can quietly exploit that.