Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nnheui
/
pythia-1.4b-dpo-full
like
0
Text Generation
Transformers
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
gpt_neox
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Deploy
Use this model
main
pythia-1.4b-dpo-full
2.83 GB
1 contributor
History:
71 commits
nnheui
End of training
1a33399
verified
over 1 year ago
runs
End of training
over 1 year ago
.gitattributes
1.52 kB
initial commit
almost 2 years ago
README.md
17.8 kB
End of training
over 1 year ago
all_results.json
928 Bytes
End of training
over 1 year ago
config.json
753 Bytes
End of training
over 1 year ago
eval_results.json
729 Bytes
End of training
over 1 year ago
generation_config.json
133 Bytes
Model save
over 1 year ago
model.safetensors
2.83 GB
xet
Model save
over 1 year ago
special_tokens_map.json
587 Bytes
Training in progress, step 100
almost 2 years ago
tokenizer.json
2.11 MB
Training in progress, step 100
over 1 year ago
tokenizer_config.json
5.27 kB
Training in progress, step 100
over 1 year ago
train_results.json
232 Bytes
Model save
over 1 year ago
trainer_state.json
378 kB
Model save
over 1 year ago
training_args.bin
6.26 kB
xet
Training in progress, step 600
over 1 year ago