VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models Paper • 2606.16140 • Published 10 days ago • 112
Running on Zero Agents Featured 65 Gemma Diffusion Website Builder 🌐 65 Watch a diffusion LLM write a website live, then tweak it
Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs Paper • 2605.09063 • Published May 9 • 82