Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Yuki20
/
gemma3-GRPO-Reasoning
like
0
Transformers
Safetensors
English
text-generation-inference
unsloth
gemma3_text
trl
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
gemma3-GRPO-Reasoning
Commit History
Upload model trained with Unsloth
757481b
verified
Yuki20
commited on
Mar 28, 2025
Upload model trained with Unsloth
1c6a8f0
verified
Yuki20
commited on
Mar 28, 2025
Upload README.md with huggingface_hub
4ba7f52
verified
Yuki20
commited on
Mar 28, 2025
initial commit
be6397a
verified
Yuki20
commited on
Mar 28, 2025