Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
EpistemeAI
/
Reasoning-Llama-3.2-3B-Math-Instruct-RE1-ORPO-align
like
0
Follow
EpisteLabs
58
Text Generation
Transformers
PyTorch
English
llama
text-generation-inference
unsloth
trl
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Reasoning-Llama-3.2-3B-Math-Instruct-RE1-ORPO-align
Commit History
Update README.md
e46f8a6
verified
legolasyiu
commited on
Feb 4, 2025
(Trained with Unsloth)
d08b634
verified
legolasyiu
commited on
Feb 4, 2025
Upload README.md with huggingface_hub
f92f286
verified
legolasyiu
commited on
Feb 4, 2025
initial commit
ec9920b
verified
legolasyiu
commited on
Feb 4, 2025