Research · Hugging Face Blog ·
MosaicLeaks: Can your research agent keep a secret?
Hugging Face Blog introduces MosaicLeaks, a benchmark that tests whether research agents can keep secrets while working on tasks. It frames the benchmark as a way to evaluate secret leakage behavior in agent systems.