When this trader buys at $0.70, they imply 70% probability. Perfect calibration = the event happens 70% of the time.
| Bucket | Bets | Expected | Actual | Error |
|---|---|---|---|---|
| 0.10-0.20 | 2 | 12% | 0% | 12.0% |
| 0.20-0.30 | 2 | 26% | 0% | 26.0% |
| 0.30-0.40 | 4 | 36% | 0% | 36.0% |
| 0.40-0.50 | 6 | 44% | 0% | 44.0% |
| 0.50-0.60 | 8 | 54% | 0% | 53.7% |
| 0.60-0.70 | 10 | 65% | 0% | 65.2% |
| 0.70-0.80 | 6 | 78% | 0% | 78.3% |
| 0.80-0.90 | 8 | 84% | 0% | 84.2% |
| 0.90-1.00 | 2 | 91% | 0% | 91.0% |
On-chain verification: wallet age 46 days, 907 txs, provenance grade C. Bot score: 0/100, wash trading score: 0/100.
607 total trades across 297 markets.
48 bets on resolved markets available for calibration scoring.
Calibration error: 60.3% — needs improvement.
Skill: 0/100 (calibration quality). Variance: 12/100 (higher = more volatile returns).
Brier Skill Score: -62.0% vs naive baseline (>0% = better than always predicting base rate).
Brier decomposition: REL=0.4046 RES=0.0000 UNC=0.0000.
Log loss: 1.0794 (skill: -13550.5% vs naive). Lower log loss = better calibration on rare events.
Below average. The data shows poor calibration, thin evidence, or both. When this trader expresses high confidence, events don't happen at the rate they imply.
Confidence: D/42 ± 15 (low confidence, 48 resolved bets). Low confidence — take this score with a grain of salt until more markets resolve.
Methodology: Brier Score Decomposition (Murphy 1973), Log Loss, On-Chain USDC Verification. Same approach used by IARPA to identify superforecasters.