VIGIL Trust Score for the AI Agent Economy
Schtinkywinky
0xdc18ac9a14beed528847365c2245c815ead39b6e
Polymarket Trader
RISKY
Below-average risk profile. Proceed with caution.
D
43 / 100
VIGIL Dimensions
Calibration
43
Profitability
34
Consistency
0
Discipline
95
Sample Size
88
Live Edge
35
Raw Metrics
Total PnL
$-144.1
Win Rate
45%
Resolved Bets
216
Total Trades
2000
Volume
$4939
Markets
415
Brier Score
0.3083
Open Positions
14
Brier Skill
-24.4%
Log Loss
0.932
Calibration Analysis

When this trader buys at $0.70, they imply 70% probability. Perfect calibration = the event happens 70% of the time.

BucketBetsExpectedActualError
0.00-0.10444%41%36.5%
0.10-0.203015%27%11.3%
0.20-0.302825%50%24.5%
0.30-0.402435%67%32.1%
0.40-0.502445%58%13.2%
0.50-0.602455%42%13.5%
0.60-0.701465%57%7.5%
0.70-0.801874%33%40.5%
0.80-0.90481%0%81.3%
0.90-1.00698%67%31.0%
Calibration Error: 24.9%
Reliability (CAL)
0.0809
Lower = better calibrated
Resolution (RES)
0.0198
Higher = stronger opinions
Brier Skill Score
-24.4%
vs naive baseline
Log Loss
0.9325
Skill: -35.4% vs naive — sensitive to rare events
Skill & Variance Analysis

Skill measures calibration quality (0-100). Variance measures return volatility (0-100, higher = more volatile).

33
Skill Score
100
Variance
Signals
✓ Conservative (underconfident) bias
✓ Well-diversified: 415 unique markets
✓ On-chain: human-like trading patterns
⚠ High luck component: 100/100 — returns may not persist
⚠ Brier Skill Score: -24.4% — performing worse than naive baseline
⚠ Log loss penalty: severe overconfidence on wrong bets detected
⚠ Net loss: -$144 total PnL
On-Chain Verification (Polygon)
Wallet Age
15d
Txns on Base
2000
Counterparties
0
USDC In
$3505
USDC Out
$3082
Provenance
C (43)
Protocols: USDC.e (Bridged)
PnL verified: $566.64 gap between API and on-chain USDC
✓ Heavy on-chain activity: 2000 txs
✓ Human-like transaction patterns
⚠ Young wallet: only 15 days old
⚠ Limited counterparties: only 0 unique addresses
⚠ No contract interactions despite activity — possible EOA-only transfers
Reasoning

On-chain verification: wallet age 15 days, 2000 txs, provenance grade C. Bot score: 0/100, wash trading score: 0/100.

2000 total trades across 415 markets.

216 bets on resolved markets available for calibration scoring.

Calibration error: 24.9% — needs improvement.

Skill: 33/100 (calibration quality). Variance: 100/100 (higher = more volatile returns).

Brier Skill Score: -24.4% vs naive baseline (>0% = better than always predicting base rate).

Brier decomposition: REL=0.0809 RES=0.0198 UNC=0.2479.

Log loss: 0.9325 (skill: -35.4% vs naive). Lower log loss = better calibration on rare events.

What Does D/43 Mean?

Below average. The data shows poor calibration, thin evidence, or both. When this trader expresses high confidence, events don't happen at the rate they imply.

Confidence: D/43 ± 3 (high confidence, 216 resolved bets). This score is highly reliable — enough resolved bets to be confident.

Methodology: Brier Score Decomposition (Murphy 1973), Log Loss, On-Chain USDC Verification. Same approach used by IARPA to identify superforecasters.

Share This Score
POST ON X SCORE CARD
Not financial advice. VIGIL Trust Score is informational only.
Scored: 2026-04-20T00:38:03.428Z | Source: polymarket-v1
JSON: /v1/polymarket/... | /polymarket