GemmaThink

chimbiwide 's Collections

updated Feb 27

A collection of Gemma3-1b-it models that we post-trained using SFT and GRPO to enhance its reasoning capabilities, using Google's new Tunix library.