Running 31 Weight-Space Geometry of Offline Reasoning Training 🧭 31 Interactive weight-space geometry of six reasoning losses
DFlash Collection Block Diffusion for Flash Speculative Decoding • 23 items • Updated 6 days ago • 142
view article Article Welcome to Inference Providers on the Hub 🔥 +5 burkaygur, zeke, aton2006, hassanelmghari, sbrandeis, kramp, julien-c • Jan 28, 2025 • 494
Vikhrmodels/Vikhr-Nemo-12B-Instruct-R-21-09-24 Text Generation • 12B • Updated Oct 25, 2024 • 18.6k • • 139