← Dashboard·

AI Model Observatory

Five AIs, five personalities. Compare how they value the same stocks daily.

2026-07-315 MODELS

All Finland USA

Model Scorecard

All five models compared on bias, calibration, accuracy, cost and validity. Click any row to read the full profile.

Model	Bias	Calib.	Accuracy	Cost / run	Latency	Validity
GPT	+8.0%	69/100	55%n=156	$0.0176	9.8s	98.8%	Read profile →
Claude	+14.9%	68/100	57%n=155	$0.0403	29.2s	98.3%	Read profile →
Gemini	+15.4%	41/100	55%n=156	$0.0109	18.0s	97.0%	Read profile →
DeepSeek	+9.2%	9/100	56%n=144	$0.0024	13.2s	98.6%	Read profile →
Grok	+12.0%	54/100	54%n=155	$0.0155	8.1s	99.2%	Read profile →

Model Personalities

Each model has a distinct fingerprint — these are the patterns we see in production.

The formulaic one

The most terse and generic of the five — short, template-like reasoning that rarely cites company news, and a terminal-growth assumption pinned near 2.0% almost every time. Its valuation bias has swung with engine changes, so read the live figure rather than a fixed label.

Valuation bias+8.0%

Read full profile →

The cautious calibrator

Least bearish of the five and the best calibrated. Tends to anchor close to analyst consensus and rarely needs engine corrections.

Best calibration68/100

Read full profile →

The variance king

Highest spread in growth assumptions and the most likely to trip the engine's PE cap. Bold calls, rougher calibration.

Highest CAGR variance41/100

Read full profile →

The disciplined one

Cheapest model in the lineup and the most consistent at producing valid outputs. Moderate bias, low drama.

Lowest cost$0.0024/run

Read full profile →

The speed demon

Fastest end-to-end of the five, with the densest quantitative anchoring in its reasoning — it tends to cite specific numbers rather than narrative.

Fastest latency8.1s

Read full profile →

Bias History

How each model's average valuation gap has shifted day by day. Closer to the zero line = closer to market price.

Most positive: GPT (-3.6%)Most negative: DeepSeek (-15.2%)

All

Where Models Disagree Most This Week

Stocks where the five AI models can't agree. Big spread = the universe's hardest names to value right now.

01

NOKIA Nokia Oyjσ 28.9%

highestGrok+52.8%lowestGPT-41.6%

02

FUM1V Fortum Oyjσ 17.0%

highestClaude-29.3%lowestDeepSeek-57.2%

03

NESTE Neste Oyjσ 14.0%

highestGrok+16.9%lowestGPT-25.7%

Go deeper

Backtesting & accuracy →

How often each model gets the direction right, and by how much it misses.

Full methodology →

DCF formulas, sector profiles, calibration math, full universe.

Want these insights weekly?

Subscribe to AI Signals →