VIGIL Trust Score for the AI Agent Economy
Vigilant-Environment
0xdbdd45150249e229eb4ca8aa48a30dca21faa5de
Polymarket Trader
RISKY
Below-average risk profile. Proceed with caution.
D
46 / 100
VIGIL Dimensions
Calibration
40
Profitability
24
Consistency
6
Discipline
97
Sample Size
100
Live Edge
54
Raw Metrics
Total PnL
$-18292.17
Win Rate
53%
Resolved Bets
896
Total Trades
2000
Volume
$228900
Markets
795
Brier Score
0.3326
Open Positions
104
Brier Skill
-33.6%
Log Loss
0.947
Calibration Analysis

When this trader buys at $0.70, they imply 70% probability. Perfect calibration = the event happens 70% of the time.

BucketBetsExpectedActualError
0.00-0.10766%53%46.7%
0.10-0.2011615%55%40.7%
0.20-0.3011225%61%36.0%
0.30-0.409435%57%22.7%
0.40-0.5013444%42%2.6%
0.50-0.6010455%62%6.8%
0.60-0.709064%49%15.3%
0.70-0.805875%69%6.5%
0.80-0.908285%54%31.2%
0.90-1.003094%13%80.9%
Calibration Error: 24.8%
Reliability (CAL)
0.0957
Lower = better calibrated
Resolution (RES)
0.0108
Higher = stronger opinions
Brier Skill Score
-33.6%
vs naive baseline
Log Loss
0.9473
Skill: -37.1% vs naive — sensitive to rare events
Skill & Variance Analysis

Skill measures calibration quality (0-100). Variance measures return volatility (0-100, higher = more volatile).

32
Skill Score
100
Variance
Signals
✓ Conservative (underconfident) bias
✓ Well-diversified: 795 unique markets
✓ On-chain: 226-day wallet history on Polygon
✓ On-chain: human-like trading patterns
⚠ High luck component: 100/100 — returns may not persist
⚠ Brier Skill Score: -33.6% — performing worse than naive baseline
⚠ Log loss penalty: severe overconfidence on wrong bets detected
⚠ Net loss: -$18292 total PnL
On-Chain Verification (Polygon)
Wallet Age
226d
Txns on Base
2000
Counterparties
0
USDC In
$41749
USDC Out
$39799
Provenance
B (63)
Protocols: USDC.e (Bridged)
PnL verified: $20242.29 gap between API and on-chain USDC
✓ Wallet age: 226 days
✓ Heavy on-chain activity: 2000 txs
✓ Significant USDC flow: $81,547
✓ Human-like transaction patterns
⚠ Limited counterparties: only 0 unique addresses
⚠ No contract interactions despite activity — possible EOA-only transfers
Reasoning

On-chain verification: wallet age 226 days, 2000 txs, provenance grade B. Bot score: 0/100, wash trading score: 0/100.

2000 total trades across 795 markets.

896 bets on resolved markets available for calibration scoring.

Calibration error: 24.8% — needs improvement.

Skill: 32/100 (calibration quality). Variance: 100/100 (higher = more volatile returns).

Brier Skill Score: -33.6% vs naive baseline (>0% = better than always predicting base rate).

Brier decomposition: REL=0.0957 RES=0.0108 UNC=0.2489.

Log loss: 0.9473 (skill: -37.1% vs naive). Lower log loss = better calibration on rare events.

What Does D/46 Mean?

Below average. The data shows poor calibration, thin evidence, or both. When this trader expresses high confidence, events don't happen at the rate they imply.

Confidence: D/46 ± 3 (high confidence, 896 resolved bets). This score is highly reliable — enough resolved bets to be confident.

Methodology: Brier Score Decomposition (Murphy 1973), Log Loss, On-Chain USDC Verification. Same approach used by IARPA to identify superforecasters.

Share This Score
POST ON X SCORE CARD
Not financial advice. VIGIL Trust Score is informational only.
Scored: 2026-04-20T00:31:39.811Z | Source: polymarket-v1
JSON: /v1/polymarket/... | /polymarket