Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
aracape
/
teaching-assistant-1B-dpo
like
0
Text Generation
Transformers
Safetensors
llama
Generated from Trainer
trl
dpo
conversational
text-generation-inference
arxiv:
2305.18290
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
teaching-assistant-1B-dpo
Commit History
Training in progress, step 50
5118f7e
verified
aracape
commited on
Jan 6
Training in progress, step 40
756372a
verified
aracape
commited on
Jan 6
Training in progress, step 20
c93a28f
verified
aracape
commited on
Jan 5
Upload config
a33c0de
verified
aracape
commited on
Dec 11, 2025
Training in progress, step 50
354ecb0
verified
aracape
commited on
Dec 11, 2025
initial commit
4b198e0
verified
aracape
commited on
Dec 11, 2025