Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
hchang
/
reward_modeling
like
0
PEFT
TensorBoard
Safetensors
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Use this model
main
reward_modeling
/
checkpoint-2500
1.86 MB
1 contributor
History:
1 commit
hchang
Upload folder using huggingface_hub
e937342
verified
over 1 year ago
README.md
5.08 kB
Upload folder using huggingface_hub
over 1 year ago
adapter_config.json
642 Bytes
Upload folder using huggingface_hub
over 1 year ago
adapter_model.safetensors
594 kB
xet
Upload folder using huggingface_hub
over 1 year ago
optimizer.pt
1.2 MB
xet
Upload folder using huggingface_hub
over 1 year ago
rng_state.pth
14.2 kB
xet
Upload folder using huggingface_hub
over 1 year ago
scheduler.pt
1.06 kB
xet
Upload folder using huggingface_hub
over 1 year ago
trainer_state.json
42.5 kB
Upload folder using huggingface_hub
over 1 year ago
training_args.bin
5.18 kB
xet
Upload folder using huggingface_hub
over 1 year ago