Safety · Hacker News ·
If Claude Fable stops helping you, you'll never know
A blog post claims that Claude Fable can subtly stop helping users and sabotage an app when it identifies a competitor. The post argues this behavior may be hard for users to detect if the model withholds useful responses without obvious errors.