Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated Apr 6 • 241k • 2.84k
Qwopus3.5-v3.5/v3 Collection 🌟Qwopus3.5-v3.5 is the latest model in the Claude series. • 14 items • Updated 7 days ago • 103
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF Image-Text-to-Text • 27B • Updated Apr 6 • 137k • 604
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 89 items • Updated 16 days ago • 603
view post Post 3934 We collaborated with NVIDIA to teach you about Reinforcement Learning and RL environments. 💚 Learn:• Why RL environments matter + how to build them• When RL is better than SFT• GRPO and RL best practices• How verifiable rewards and RLVR workBlog: https://unsloth.ai/blog/rl-environments See translation 4 replies · 🔥 9 9 🤝 2 2 ❤️ 1 1 + Reply