ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 Reinforcement Learning • 15B • Updated Feb 13, 2025 • 2.1k • 840
Running 601 Scaling test-time compute 📈 601 Boost LLM answers with flexible test‑time search strategies