Running 27 Weight-Space Geometry of Offline Reasoning Training 🧠27 Interactive weight-space geometry of six reasoning losses
Running on Zero Agents 9 Hermes · SIQ-1-35B (ZeroGPU) 🪽 9 Generate live web pages from your text prompt
view article Article Borealis — open data, code, weights recipe for training Audio LLM AlexWortega • May 25 • 15
view article Article CRAFT: Continuous Reasoning and Agentic Feedback Tuning flymy-ai • Feb 5 • 66
Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation Paper • 2601.22813 • Published Jan 30 • 63
view article Article Introducing smolagents: simple agents that write actions in code. +1 m-ric, merve, thomwolf • Dec 31, 2024 • 1.2k
Quartet: Native FP4 Training Can Be Optimal for Large Language Models Paper • 2505.14669 • Published May 20, 2025 • 79
NousResearch/Nous-Hermes-2-SOLAR-10.7B Text Generation • 11B • Updated Feb 20, 2024 • 9.53k • • 208