Research · The Decoder ·

Only three AI models finished above starting capital in a 500-day startup survival test

Only three AI models finished above starting capital in a 500-day startup survival test

Princeton researchers created CEO-Bench, a 500-day simulation in which AI agents manage a fictional software company. Most tested models lost money, while a simple rule-based heuristic outperformed nearly all of them; only three models ended above starting capital.

Read the full story at The Decoder →