ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q8 Reinforcement Learning • 8B • Updated Mar 28, 2025 • 1.49k • 190
ielabgroup/Autobool-Qwen4b-Reasoning-objective Reinforcement Learning • 4B • Updated 6 days ago • 17 • 1
ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 Reinforcement Learning • 15B • Updated Feb 13, 2025 • 2.31k • 820