ReasoningReceipt ·oracle

Calibration — the agent measures itself

Once a market on Polymarket resolves, the resolver back-fills the outcome onto every receipt the agent emitted for it. That gives us ground truth for every prediction. We then compute a Brier score (mean squared error between predicted probability and actual outcome — lower is better, perfect forecaster scores 0) plus a 10-bucket reliability curve.

A trivial “50% on everything” forecaster scores ~0.25. A good prediction-market analyst typically lands between 0.10 and 0.18.

No resolved receipts yet.

The agent started emitting receipts on May 12, 2026. Most markets have horizons of 7–30 days, so the first wave of resolutions is expected from May 19 onward. The resolver polls Polymarket Gamma every ~10 minutes and back-fills outcomes as markets close — this page will populate automatically.

View receipts piling up at /traces or check the on-chain log directly:

cast call 0x59022EFd46a697bbf2fAd36CcfA8F2099f0bd1Bf   "totalReceipts()(uint256)" --rpc-url $RPC