When this trader buys at $0.70, they imply 70% probability. Perfect calibration = the event happens 70% of the time.
| Bucket | Bets | Expected | Actual | Error |
|---|---|---|---|---|
| 0.00-0.10 | 138 | 5% | 1% | 3.2% |
| 0.10-0.20 | 122 | 15% | 7% | 8.4% |
| 0.20-0.30 | 98 | 25% | 4% | 20.8% |
| 0.30-0.40 | 88 | 35% | 14% | 21.6% |
| 0.40-0.50 | 116 | 45% | 0% | 45.1% |
| 0.50-0.60 | 112 | 54% | 5% | 49.0% |
| 0.60-0.70 | 122 | 65% | 5% | 59.7% |
| 0.70-0.80 | 80 | 76% | 3% | 73.0% |
| 0.80-0.90 | 66 | 85% | 3% | 81.6% |
| 0.90-1.00 | 32 | 95% | 0% | 95.4% |
Skill measures calibration quality (0-100). Variance measures return volatility (0-100, higher = more volatile).
On-chain verification: wallet age 286 days, 2000 txs, provenance grade B. Bot score: 0/100, wash trading score: 0/100.
Polymarket on-chain coverage: $12,387 in / $0 out across 94 withdrawal tx since 2025-07-09.
3100 total trades across 861 markets.
974 bets on resolved markets available for calibration scoring.
Calibration error: 38.7% — needs improvement.
Skill: 14/100 (calibration quality). Variance: 45/100 (higher = more volatile returns).
Brier Skill Score: -542.8% vs naive baseline (>0% = better than always predicting base rate).
Brier decomposition: REL=0.2250 RES=0.0013 UNC=0.0413.
Log loss: 0.7625 (skill: -329.0% vs naive). Lower log loss = better calibration on rare events.
Below average. The data shows poor calibration, thin evidence, or both. When this trader expresses high confidence, events don't happen at the rate they imply.
Confidence: D/44 ± 3 (high confidence, 974 resolved bets). This score is highly reliable — enough resolved bets to be confident.
Methodology: Brier Score Decomposition (Murphy 1973), Log Loss, On-Chain USDC Verification. Same approach used by IARPA to identify superforecasters.