AI Model Observatory
Five AIs, five personalities. Compare how they value the same stocks daily.
2026-06-175 MODELS
Model Scorecard
All five models compared on bias, calibration, accuracy, cost and validity. Click any row to read the full profile.
| Model | Bias | Calib. | Accuracy | Cost / run | Latency | Validity | |
|---|---|---|---|---|---|---|---|
| GPT | -0.5% | 69/100 | 49%n=1715 | $0.0168 | 10.5s | 98.1% | Read profile → |
| Claude | +0.0% | 67/100 | 49%n=1711 | $0.0391 | 28.6s | 97.9% | Read profile → |
| Gemini | -0.7% | 71/100 | 47%n=1689 | $0.0105 | 18.2s | 96.7% | Read profile → |
| DeepSeek | -5.4% | 67/100 | 48%n=1748 | $0.0022 | 13.8s | 100.0% | Read profile → |
| Grok | -3.5% | 63/100 | 48%n=1737 | $0.0153 | 8.0s | 99.4% | Read profile → |
Model Personalities
Each model has a distinct fingerprint — these are the patterns we see in production.
GPT
The formulaic one
The most terse and generic of the five — short, template-like reasoning that rarely cites company news, and a terminal-growth assumption pinned near 2.0% almost every time. Its valuation bias has swung with engine changes, so read the live figure rather than a fixed label.
Valuation bias-0.5%
Read full profile →
Claude
The cautious calibrator
Least bearish of the five and the best calibrated. Tends to anchor close to analyst consensus and rarely needs engine corrections.
Best calibration67/100
Read full profile →
Gemini
The variance king
Highest spread in growth assumptions and the most likely to trip the engine's PE cap. Bold calls, rougher calibration.
Highest CAGR variance71/100
Read full profile →
DeepSeek
The disciplined one
Cheapest model in the lineup and the most consistent at producing valid outputs. Moderate bias, low drama.
Lowest cost$0.0022/run
Read full profile →
Grok
The speed demon
Fastest end-to-end of the five, with the densest quantitative anchoring in its reasoning — it tends to cite specific numbers rather than narrative.
Fastest latency8.0s
Read full profile →
Bias History
How each model's average valuation gap has shifted day by day. Closer to the zero line = closer to market price.
Where Models Disagree Most This Week
Stocks where the five AI models can't agree. Big spread = the universe's hardest names to value right now.
02
METAMeta Platforms Inc.σ 14.3%
highestGemini+61.5%lowestDeepSeek+12.1%
03
AMZNAmazon.com Inc.σ 13.3%
highestGemini+1.1%lowestDeepSeek-26.3%
Go deeper
Want these insights weekly?
Subscribe to AI Signals →