AI Model Observatory
Five AIs, five personalities. Compare how they value the same stocks daily.
2026-06-175 MODELS
Model Scorecard
All five models compared on bias, calibration, accuracy, cost and validity. Click any row to read the full profile.
| Model | Bias | Calib. | Accuracy | Cost / run | Latency | Validity | |
|---|---|---|---|---|---|---|---|
| GPT | +0.7% | 57/100 | 48%n=830 | $0.0170 | 10.9s | 99.3% | Read profile → |
| Claude | -1.1% | 56/100 | 48%n=821 | $0.0397 | 29.1s | 98.2% | Read profile → |
| Gemini | -2.0% | 61/100 | 46%n=804 | $0.0107 | 18.5s | 96.2% | Read profile → |
| DeepSeek | -4.6% | 58/100 | 47%n=836 | $0.0022 | 14.1s | 100.0% | Read profile → |
| Grok | -2.1% | 38/100 | 48%n=831 | $0.0155 | 8.2s | 99.4% | Read profile → |
Model Personalities
Each model has a distinct fingerprint — these are the patterns we see in production.
GPT
The formulaic one
The most terse and generic of the five — short, template-like reasoning that rarely cites company news, and a terminal-growth assumption pinned near 2.0% almost every time. Its valuation bias has swung with engine changes, so read the live figure rather than a fixed label.
Valuation bias+0.7%
Read full profile →
Claude
The cautious calibrator
Least bearish of the five and the best calibrated. Tends to anchor close to analyst consensus and rarely needs engine corrections.
Best calibration56/100
Read full profile →
Gemini
The variance king
Highest spread in growth assumptions and the most likely to trip the engine's PE cap. Bold calls, rougher calibration.
Highest CAGR variance61/100
Read full profile →
DeepSeek
The disciplined one
Cheapest model in the lineup and the most consistent at producing valid outputs. Moderate bias, low drama.
Lowest cost$0.0022/run
Read full profile →
Grok
The speed demon
Fastest end-to-end of the five, with the densest quantitative anchoring in its reasoning — it tends to cite specific numbers rather than narrative.
Fastest latency8.2s
Read full profile →
Bias History
How each model's average valuation gap has shifted day by day. Closer to the zero line = closer to market price.
Where Models Disagree Most This Week
Stocks where the five AI models can't agree. Big spread = the universe's hardest names to value right now.
Go deeper
Want these insights weekly?
Subscribe to AI Signals →