Running 27 Weight-Space Geometry of Offline Reasoning Training 🧠27 Interactive weight-space geometry of six reasoning losses
Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation Paper • 2601.22813 • Published Jan 30 • 63