Running 27 Weight-Space Geometry of Offline Reasoning Training 🧭 27 Interactive weight-space geometry of six reasoning losses
ISTA-DASLab/Meta-Llama-3.1-8B-Instruct-AQLM-PV-2Bit-1x16-hf Text Generation • 2B • Updated Aug 28, 2024 • 89 • 8