Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lordChipotle
/
Llama3GRPOReasoning
like
1
Reinforcement Learning
Safetensors
openai/gsm8k
llama
Model card
Files
Files and versions
xet
Community
main
Llama3GRPOReasoning
/
generation_config.json
Commit History
Upload LlamaForCausalLM
b3ebf5c
verified
lordChipotle
commited on
Jun 4, 2025