Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Pedro13543
/
test_grpo
like
0
Text Generation
Transformers
PyTorch
openai/gsm8k
llama
unsloth
trl
grpo
conversational
text-generation-inference
arxiv:
1910.09700
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
test_grpo
/
pytorch_model.bin
Commit History
Trained with Unsloth
b300421
verified
Pedro13543
commited on
Mar 4, 2025
Trained with Unsloth
db63093
verified
Pedro13543
commited on
Mar 4, 2025
Trained with Unsloth
03a40be
verified
Pedro13543
commited on
Mar 4, 2025
Trained with Unsloth
8c5bbe4
verified
Pedro13543
commited on
Mar 3, 2025