Research · Hacker News ·
GLM 5.2 beats Claude in our benchmarks
Semgrep reports benchmark results claiming GLM 5.2 outperformed Claude on its cybersecurity evaluation set. The post links to a blog entry describing the tests and their comparison methodology.