trl-internal-testing/tiny-DeepseekV3ForCausalLM Text Generation • 5.52M • Updated Dec 19, 2025 • 189k • 3
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF Text Generation • 480B • Updated Jul 31, 2025 • 16.8k • 177