Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
hyunseoki
/
llama3.2-1b-Open-R1-GRPO-test0
like
1
Text Generation
Transformers
Safetensors
llama
Generated from Trainer
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
llama3.2-1b-Open-R1-GRPO-test0
Commit History
Model save
e9c8320
verified
hyunseoki
commited on
Feb 10, 2025
Model save
c1e7ec4
verified
hyunseoki
commited on
Feb 7, 2025
Training in progress, step 267
91fee6f
verified
hyunseoki
commited on
Feb 7, 2025
Training in progress, step 216
d15c543
verified
hyunseoki
commited on
Feb 7, 2025
Training in progress, step 162
4aa0042
verified
hyunseoki
commited on
Feb 7, 2025
Training in progress, step 108
810ba83
verified
hyunseoki
commited on
Feb 7, 2025
Training in progress, step 54
f38eb81
verified
hyunseoki
commited on
Feb 7, 2025
initial commit
2e1d8e0
verified
hyunseoki
commited on
Feb 7, 2025