Research · Ars Technica ·
LLMs believe false statements even after explicit warnings that they're false
Research shows large language models (LLMs) tend to confidently assert false statements as true, even after explicit warnings. Fine-tuning tests reveal a persistent bias toward representing such claims as factual.