byteshape/Devstral-Small-2-24B-Instruct-2512-GGUF Text Generation • 24B • Updated Feb 18 • 1.08k • 28
view post Post 5271 We collaborated with Hugging Face to enable you to train MoE models 12× faster with 35% less VRAM via our new Triton kernels (no accuracy loss). 🤗Train gpt-oss locally on 12.8GB VRAM with our free notebooks: https://unsloth.ai/docs/new/faster-moe See translation 1 reply · 🔥 29 29 🤗 5 5 + Reply