ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q8 Reinforcement Learning • 8B • Updated Mar 28, 2025 • 17.5k • 200
ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q4 Reinforcement Learning • 8B • Updated Mar 26, 2025 • 718 • 224
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B Reinforcement Learning • 2B • Updated Apr 6, 2025 • 75 • 1
NousResearch/DeepHermes-ToolCalling-Specialist-Atropos Reinforcement Learning • 8B • Updated Apr 28, 2025 • 1 • 15
ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 Reinforcement Learning • 15B • Updated Feb 13, 2025 • 2.14k • 821