Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
AMindToThink
/
PYTHIA-FT-ORPO-ISAERFT
like
0
Transformers
Generated from Trainer
smol-course
module_1
isaerft
arxiv:
2403.07691
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
PYTHIA-FT-ORPO-ISAERFT
/
trainable_param.json
Commit History
End of training
42aec0a
verified
AMindToThink
commited on
Feb 28, 2025