Research · Hugging Face Blog ·

MosaicLeaks: Can your research agent keep a secret?

MosaicLeaks: Can your research agent keep a secret?

Hugging Face Blog introduces MosaicLeaks, a benchmark that tests whether research agents can keep secrets while working on tasks. It frames the benchmark as a way to evaluate secret leakage behavior in agent systems.

Read the full story at Hugging Face Blog →