Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
LLucass
/
TT_L0.2_H0.2_dr_grpo
like
0
Text Generation
Transformers
Safetensors
knoveleng/open-rs
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
TT_L0.2_H0.2_dr_grpo
Commit History
End of training
bc8969e
verified
LLucass
commited on
Jun 8, 2025
Model save
aa5f5a3
verified
LLucass
commited on
Jun 8, 2025
Training in progress, step 200, checkpoint
fd4eeef
verified
LLucass
commited on
Jun 8, 2025
Training in progress, step 200
be242b8
verified
LLucass
commited on
Jun 8, 2025
Training in progress, step 150, checkpoint
5b530eb
verified
LLucass
commited on
Jun 8, 2025
Training in progress, step 150
ab298bb
verified
LLucass
commited on
Jun 8, 2025
Training in progress, step 100, checkpoint
dbed49b
verified
LLucass
commited on
Jun 8, 2025
Training in progress, step 100
2dab727
verified
LLucass
commited on
Jun 8, 2025
Training in progress, step 50, checkpoint
1b6f6d0
verified
LLucass
commited on
Jun 8, 2025
Training in progress, step 50
2e21f0d
verified
LLucass
commited on
Jun 8, 2025
initial commit
39d8928
verified
LLucass
commited on
Jun 8, 2025