Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nileshmalpeddi
/
ppo
like
0
Text Generation
Transformers
TensorBoard
Safetensors
trl-internal-testing/descriptiveness-sentiment-trl-style
exaone
Generated from Trainer
conversational
custom_code
arxiv:
1909.08593
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Deploy
Use this model
main
ppo
725 MB
1 contributor
History:
133 commits
nileshmalpeddi
Training in progress, step 74000
e2ff73d
verified
10 months ago
runs
Training in progress, step 74000
10 months ago
.gitattributes
1.52 kB
initial commit
11 months ago
README.md
2.06 kB
End of training
10 months ago
config.json
1.25 kB
Training in progress, step 500
10 months ago
configuration_exaone.py
9.95 kB
Model save
10 months ago
generation_config.json
111 Bytes
Model save
10 months ago
merges.txt
1.22 MB
Training in progress, step 5
10 months ago
model.safetensors
635 MB
xet
Training in progress, step 74000
10 months ago
special_tokens_map.json
563 Bytes
Training in progress, step 5
10 months ago
tokenizer.json
7.91 MB
Training in progress, step 500
10 months ago
tokenizer_config.json
70.8 kB
Training in progress, step 5
10 months ago
trainer_state.json
1.5 kB
End of training
10 months ago
training_args.bin
7.48 kB
xet
Training in progress, step 500
10 months ago
vocab.json
1.93 MB
Training in progress, step 5
10 months ago