view post Post 2920 Good news, llama.cpp seems to be close to supporting MTP on qwen models. Bad news, every single gguf will have to be redone when it is. See translation 1 reply · 👀 15 15 + Reply
view post Post 3812 Google releases Gemma 4. ✨Gemma 4 introduces 4 models: E2B, E4B, 26B-A4B, 31B.The multimodal reasoning models are under Apache 2.0.Run E2B and E4B on ~6GB RAM, and on phones. Run 26B-A4B and 31B on ~18GB.GGUFs: https://huggingface.co/collections/unsloth/gemma-4Guide: https://unsloth.ai/docs/models/gemma-4 See translation 🔥 22 22 🚀 8 8 👍 1 1 ❤️ 1 1 + Reply
Running Agents Featured 208 Voxtral TTS Demo ⚡ 208 Generate realistic speech from text with custom or preset voices
mistralai/Voxtral-Mini-4B-Realtime-2602 Automatic Speech Recognition • 4B • Updated Mar 11 • 1.29M • 847