S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence Paper • 2606.20515 • Published 10 days ago • 39
PP-OCRv6 Collection From 1.5M to 34.5M Parameters, Surpassing Billion-Scale VLMs on OCR Tasks • 19 items • Updated 12 days ago • 97
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer Paper • 2605.15178 • Published May 14 • 91
Mistral Small 4 Collection A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated Mar 16 • 75
view article Article Exploring Environments Hub: Your Language Model needs better (open) environments to learn anakin87 • Sep 4, 2025 • 31
view article Article HUMAINE: A Rigorous Framework for Understanding AI Through Human Experience ProlificAI • Sep 16, 2025 • 7
view article Article We Got Claude to Fine-Tune an Open Source LLM burtenshaw, evalstate • Dec 4, 2025 • 630
Transformers.js demos Collection A collection of my favorite WebML demos, built with Transformers.js! • 30 items • Updated 10 days ago • 146
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Paper • 2511.22570 • Published Nov 27, 2025 • 96
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published Nov 9, 2025 • 141
The Bestiary Collection Decensored language models made using Heretic (https://github.com/p-e-w/heretic) • 5 items • Updated May 21 • 115