Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
tally0818
/
ContextGRPO_2x4_random
like
0
Text Generation
PEFT
Safetensors
Transformers
grpo
lora
trl
conversational
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Use this model
main
ContextGRPO_2x4_random
Commit History
Upload 10 files
847a945
verified
tally0818
commited on
14 days ago
initial commit
bf0304d
verified
tally0818
commited on
14 days ago