chimbiwide/gemma-3-1b-it-thinking-32k-grpo-merged-Q8_0-GGUF
1.0B
•
Updated
•
41
A collection of Gemma3-1b-it models that we post-trained using SFT and GRPO to enhance its reasoning capabilities, using Google's new Tunix library.