mradermacher/Autobool-Qwen4b-No-reasoning-GGUF Reinforcement Learning • 4B • Updated 23 days ago • 461
mradermacher/Autobool-Qwen4b-Reasoning-objective-GGUF Reinforcement Learning • 4B • Updated 23 days ago • 499