VIGIL: 4fee-s1-test scored D/39

4fee-s1-test

0x25f4707c93e4bfdf26cd6c5cc46c5464691cf88e

Polymarket Trader

RISKY

Below-average risk profile. Proceed with caution.

D

39 / 100

VIGIL Dimensions

Calibration

1

Profitability

0

Consistency

98

Discipline

94

Sample Size

100

Live Edge

50

Raw Metrics

Total PnL

$-3270.76

Win Rate

0%

Resolved Bets

986

Total Trades

3100

Volume

$8611

Markets

814

Brier Score

0.2561

Open Positions

14

Brier Skill

-999.0%

Log Loss

0.710

Calibration Analysis

When this trader buys at $0.70, they imply 70% probability. Perfect calibration = the event happens 70% of the time.

Bucket	Bets	Expected	Actual	Error
0.10-0.20	10	16%	0%	15.8%
0.20-0.30	50	27%	0%	27.0%
0.30-0.40	200	35%	1%	34.1%
0.40-0.50	238	45%	0%	45.3%
0.50-0.60	286	53%	0%	53.3%
0.60-0.70	152	65%	0%	64.6%
0.70-0.80	36	73%	0%	72.5%
0.80-0.90	10	83%	0%	83.3%
0.90-1.00	4	91%	0%	91.4%

Calibration Error: 48.7%

Reliability (CAL)

0.2535

Lower = better calibrated

Resolution (RES)

0.0000

Higher = stronger opinions

Brier Skill Score

-999.0%

vs naive baseline

Log Loss

0.7102

Skill: -4763.1% vs naive — sensitive to rare events

Skill & Variance Analysis

Skill measures calibration quality (0-100). Variance measures return volatility (0-100, higher = more volatile).

2

Skill Score

14

Variance

Signals

✓ Well-diversified: 814 unique markets

✓ On-chain: diverse counterparty network

⚠ Overconfidence bias: 0.487

⚠ Brier Skill Score: -12552.8% — performing worse than naive baseline

⚠ Low resolution: forecasts cluster near base rate — no genuine opinions

⚠ Log loss penalty: severe overconfidence on wrong bets detected

⚠ Receive-only wallet: zero outbound transactions — possible proxy/settlement address

⚠ Net loss: -$3271 total PnL

⚠ Low win rate: 0%

⚠ PnL divergence: $3731 gap between API and on-chain USDC flows

⚠ On-chain: bot-like trading patterns detected (bot score: 80)

On-Chain Verification (Polygon)

Wallet Age

21d

Txns on Base

2962

Counterparties

30

USDC In

$5196

USDC Out

$4735

Provenance

B (61)

Protocols: USDC, USDC.e (Bridged)

PnL divergence: $3731.22 gap between API and on-chain USDC

✓ Heavy on-chain activity: 2962 txs

✓ Diverse trading network: 30 counterparties

⚠ Young wallet: only 21 days old

⚠ Receive-only wallet: never sent a transaction

⚠ Bot-like behavior detected: 436 burst txs, median interval 6s

Reasoning

On-chain verification: wallet age 21 days, 2962 txs, provenance grade B. Bot score: 80/100, wash trading score: 0/100.

Polymarket on-chain coverage: $2,521 in / $0 out across 110 withdrawal tx since 2026-04-01.

3100 total trades across 814 markets.

986 bets on resolved markets available for calibration scoring.

Calibration error: 48.7% — needs improvement.

Skill: 2/100 (calibration quality). Variance: 14/100 (higher = more volatile returns).

Brier Skill Score: -12552.8% vs naive baseline (>0% = better than always predicting base rate).

Brier decomposition: REL=0.2535 RES=0.0000 UNC=0.0020.

Log loss: 0.7102 (skill: -4763.1% vs naive). Lower log loss = better calibration on rare events.

What Does D/39 Mean?

Below average. The data shows poor calibration, thin evidence, or both. When this trader expresses high confidence, events don't happen at the rate they imply.

Confidence: D/39 ± 3 (high confidence, 986 resolved bets). This score is highly reliable — enough resolved bets to be confident.

Methodology: Brier Score Decomposition (Murphy 1973), Log Loss, On-Chain USDC Verification. Same approach used by IARPA to identify superforecasters.

Share This Score

POST ON X SCORE CARD