Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lordChipotle
/
Llama3GRPOReasoning
like
1
Reinforcement Learning
Safetensors
openai/gsm8k
llama
Model card
Files
Files and versions
xet
Community
main
Llama3GRPOReasoning
/
tokenizer.json
Commit History
Upload tokenizer
95f662e
verified
lordChipotle
commited on
Jun 4, 2025