mradermacher/ASTRA-14B-Thinking-v1-i1-GGUF Reinforcement Learning • 15B • Updated 15 days ago • 5.85k