Cross-venue calibration: Manifold vs Kalshi on Fed events

Manifold (play-money) is widely cited in the academic literature as well-calibrated; Kalshi (real-money, CFTC-regulated) shows favorite-longshot bias in some categories. What does the same FOMC rate decision look like priced on both venues? Here's the methodology for the comparison and the back-of-envelope arb the spread implies.

Why this comparison matters

Calibration is the question every prediction market eventually faces: "when the market says 70%, does it happen 70% of the time?"

For most categories the empirical answer hovers near "yes, but with bias". The classic finding (well-replicated since the 1970s on parimutuel betting) is the favorite-longshot bias: longshots (markets pricing <10%) systematically over-resolve YES; favorites (markets pricing >90%) under-resolve YES. The gap is real money on real-money venues; on play-money venues it's smaller, sometimes absent.

A 2026 Fed Reserve paper (covered in Axios, Feb 2026) gave Kalshi a "perfect forecast record" on Fed rate decisions the day before each FOMC meeting — beating fed funds futures. That's a strong claim. The natural follow-up: does Manifold beat Kalshi on the same events?

The dataset surface

Both venues run identifiable Fed event markets — typically titled "Will the FOMC raise/hold/cut rates by X bps at the Y meeting?". Manifold has the wider category coverage; Kalshi has tighter contract specifications and more volume. The cross-venue match is at the level of same underlying event, slightly different question wording.

Our canonical schema makes this tractable: both venues map into the same markets + outcomes tables with a category, resolution_value, and a resolved_at timestamp. The matching problem becomes a fuzzy text-similarity join on the question text plus a closes_at alignment within ±48 hours.

WITH kalshi_fed AS (
  SELECT market_id, title, closes_at, resolved_at,
         resolution_value ->> 'YES' AS yes_payout
  FROM markets
  WHERE venue_id = 'kalshi'
    AND title ILIKE '%FOMC%'
    AND resolved_at IS NOT NULL
),
manifold_fed AS (
  SELECT market_id, title, closes_at, resolved_at,
         resolution_value ->> 'YES' AS yes_payout
  FROM markets
  WHERE venue_id = 'manifold'
    AND (title ILIKE '%FOMC%' OR title ILIKE '%fed%rate%')
    AND resolved_at IS NOT NULL
)
SELECT k.title AS kalshi_title, m.title AS manifold_title,
       k.closes_at, k.yes_payout, m.yes_payout AS manifold_yes_payout
FROM kalshi_fed k JOIN manifold_fed m
  ON ABS(EXTRACT(epoch FROM k.closes_at - m.closes_at)) < 172800;

In Phase-0 we have a 50-market sample per venue, which is too small for proper Fed coverage. The example query is the shape, not a quantitative claim. Phase 1 expands the Manifold + Kalshi backfills to cover 2023–2025 macro events fully, which is roughly ~80 paired Fed-decision contracts to cross-check.

The expected pattern

Three calibration plots become available once the matched pairs exist:

1. Implied probability vs realized. Bin Kalshi's 24-hour-pre-close implied probability into deciles. For each decile, plot the empirical YES rate. A perfectly calibrated venue produces the diagonal. Favorite-longshot bias bows away from the diagonal: high-probability deciles fall below, low-probability deciles rise above.

2. Cross-venue spread. For each matched pair, compute p_kalshi − p_manifold at T-24h. Distributional summary by category (Fed, election, sports). If Manifold is consistently below Kalshi on FOMC contracts, that's a structural belief difference.

3. Basis decay. Track the spread's evolution from T-7d to T-1h to T+0. Does it converge as resolution nears, or diverge? A converging spread with Manifold consistently below suggests Manifold is the "smart" venue (real-money traders should arbitrage in the Manifold direction). Diverging suggests one venue's pricing is capturing information the other isn't.

The arb (back-of-envelope)

Real-money arb across Kalshi and Manifold is structurally hard:

Manifold is play-money. There's no real-money payout, so positions don't hedge Kalshi exposure in dollar terms.
Kalshi requires KYC + a US bank account. Cross-venue automation requires both accounts open and funded.
Position size on Manifold is small (~$1,000 mana = ~$0 USD).

But the signal arb is real and works for institutional users:

Detect when Manifold significantly disagrees with Kalshi on a Fed event (e.g., Manifold 35% YES, Kalshi 60% YES).
If Manifold's track record is calibrated and Kalshi shows favorite-longshot drift, Kalshi's 60% is an over-confident favorite. Take NO on Kalshi at $0.40, expected edge if Manifold is right ~25%.
Position-size by the calibration confidence interval, not the raw spread.

For a pair-trading desk, this is a residual risk strategy on top of fed-funds futures. For a calibration researcher, it's the empirical question of which venue is "right" on what.

What our dataset enables

Two entries into this analysis:

Backtest mode. Pull all matched (Kalshi, Manifold) pairs with resolved_at IS NOT NULL. Compute calibration plots per venue and the cross-venue residual. Reproducible because both venues' resolutions are stored in outcomes.final_payout.

Live signal mode. Pull active matched pairs (both resolved_at IS NULL). Subscribe to price updates on both venues. Alert when |p_kalshi − p_manifold| > threshold with the threshold calibrated against the historical spread distribution.

The Phase-1 expansion adds 2023–2025 macro coverage and a published cross-venue reconciliation matrix — i.e., for every Kalshi market, the closest Manifold equivalent and the historical spread distribution. That artifact is the closest thing to a "calibrated reference" we know how to build.

Caveats

Title matching is hard. Manifold users phrase questions creatively; Kalshi titles are standardized. Phase 1 includes an embedding-based matcher (we use voyage-finance-2 in a parallel project; same approach here) with manual verification on the top 100 paired markets.
Resolution definitions diverge. Manifold "Will Powell announce a 25 bps cut at the May meeting?" might resolve YES on a 25-bp cut OR a 50-bp cut depending on creator intent. Kalshi's contract spec is precise. We use Kalshi's spec as the canonical interpretation when reconciling.
Liquidity asymmetry. Kalshi has 100–1000× the dollar volume on macro events; Manifold has more user diversity but tighter spreads on Manifold- specific markets. Calibration robustness suffers when either side is thinly traded.

Takeaways

Cross-venue calibration is one of the most useful research outputs the dataset can produce. It tells you which venue to trust on what.
The matching is fuzzy but tractable with a normalized schema + embedding lookup.
For institutional traders, the spread is a signal, not an arb — Manifold's play-money limits direct hedging, but disagreement encodes information.
Phase 1 ships the matched-pair artifact. Email us if you're working on a calibration paper and want a preview of the matching code.

Email for a preview of the matched-pair dataset, or read the schema to see how the venue fields line up.