Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
bsvaz
/
teaching-llm-to-reason
like
0
Text Generation
PEFT
Safetensors
Transformers
grpo
lora
trl
unsloth
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Use this model
main
teaching-llm-to-reason
479 MB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
bsvaz
Upload model from training
b4eff70
verified
24 days ago
.gitattributes
Safe
1.52 kB
initial commit
24 days ago
README.md
Safe
5.27 kB
Upload model from training
24 days ago
adapter_config.json
1.09 kB
Upload model from training
24 days ago
adapter_model.safetensors
479 MB
xet
Upload model from training
24 days ago