Research · The Decoder · 3 July 2026

GPT and Claude failed Bridgewater's finance tests because the right answers were never public

Bridgewater and Thinking Machines Lab say a finely tuned open-weight model outperformed leading AI models on financial document evaluations at much lower cost. The results were based on the organizations' own analysis, and the article notes that the correct answers were not publicly available.

Read the full story at The Decoder →