LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling Paper • 2606.18023 • Published 9 days ago • 203
VisualClaw: A Real-Time, Personalized Agent for the Physical World Paper • 2606.16295 • Published 10 days ago • 28
Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents Paper • 2605.30621 • Published 28 days ago • 22
Mistral Medium 3.5 Collection Our first flaship models handling instruction-following, reasoning, and coding in a single set of opened-weights. • 2 items • Updated Apr 29 • 19
AutoMedBench: Towards Medical AutoResearch with Agentic AI Models Paper • 2606.01961 • Published 22 days ago • 27