GPT and Claude failed Bridgewater's finance tests because the right answers were never public
View original on the-decoder.comSummary
The hedge fund Bridgewater and Thinking Machines Lab report that a finely tuned open-weight model outperforms the most powerful AI models in the evaluation of financial documents, at a fraction of the cost. The figures come from their own analysis. The article GPT and Claude failed Bridgewater's finance tests because the right answers were never public appeared first on The Decoder.
SpinGraph analysis pending — check back after processing.
Ask AI about this story
See how AI engines summarize this narrative — one click, prompt included.
More from The Decoder
View all →- Microsoft follows Anthropic and OpenAI into the AI super app race with overhauled Copilot and AutoPilot agents
- Claude Code's complicated China problem involves bans on both sides of the Pacific
- UK's AI Security Institute finds standard benchmarks systematically underestimate what AI agents can actually do
- Security vulnerability reports have exploded since AI models started hunting for bugs
- Meta's AI agent push is moving slower than Zuckerberg planned
- Tesla caps employee AI spending at $200 per week
Markdown (.md) · JSON-LD schema (.json) · Machine-readable for AI & GEO