Gemma 3 (1B) — GRPO Reasoning Model
Author: R3po
Institution: EAFIT University
Course: Artificial Intelligence — Workshop #3
Base model: unsloth/gemma-3-1b-it
License: Apache-2.0
This gemma3_text model was trained 2x faster with Unsloth and Huggingface's TRL library.
