view post Post 2782 Qwen3.6 MTP is here! Run locally on 20GB RAM. ⚡️MTP enables Qwen3.6 to generate ~1.4–2.2× faster with no accuracy change.Qwen3.6-27B: unsloth/Qwen3.6-27B-MTP-GGUFQwen3.6-35B-A3B: unsloth/Qwen3.6-35B-A3B-MTP-GGUFGuide: https://unsloth.ai/docs/models/qwen3.6#mtp-guide See translation 2 replies · 👍 12 12 🔥 4 4 🤗 3 3 🚀 3 3 ❤️ 1 1 🧠 1 1 😎 1 1 + Reply
view post Post 7750 We collaborated with NVIDIA to teach you how we made LLM training ~25% faster! 🚀Learn how 3 optimizations help your home GPU train models faster:1. Packed-sequence metadata caching2. Double-buffered checkpoint reloads3. Faster MoE routingGuide: https://unsloth.ai/blog/nvidia-collabGitHub: https://github.com/unslothai/unsloth See translation 🔥 21 21 🚀 4 4 🤝 2 2 😔 1 1 + Reply
TeichAI/GLM-4.7-Flash-Claude-Opus-4.5-High-Reasoning-Distill Text Generation • 31B • Updated Feb 9 • 51 • 53