Xuerui2312/DeepSeek-R1-Distill-Qwen-7B-TRPA-DeepScaleR-verl0326 Text Generation • 8B • Updated Jun 20, 2025 • 3 • 1
ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q8 Reinforcement Learning • 8B • Updated Mar 28, 2025 • 23.8k • 200
ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q4 Reinforcement Learning • 8B • Updated Mar 26, 2025 • 794 • 224