mradermacher/MARSHAL-Generalist-Qwen3-8B-GGUF Reinforcement Learning • 8B • Updated Dec 2, 2025 • 669