GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8, 2025 • 200
Establishing Best Practices for Building Rigorous Agentic Benchmarks Paper • 2507.02825 • Published Jul 3, 2025 • 1