Research · The Decoder ·
Only three AI models finished above starting capital in a 500-day startup survival test
Princeton researchers created CEO-Bench, a 500-day simulation in which AI agents manage a fictional software company. Most tested models lost money, while a simple rule-based heuristic outperformed nearly all of them; only three models ended above starting capital.