Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
berkeley-nest
/
Starling-RM-7B-alpha
like
103
Follow
Berkeley-Nest
80
Transformers
PyTorch
berkeley-nest/Nectar
English
llama
reward model
RLHF
RLAIF
text-generation-inference
arxiv:
2203.02155
arxiv:
2301.11270
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
8
Deploy
Use this model
refs/pr/5
Starling-RM-7B-alpha
26.7 GB
6 contributors
History:
14 commits
davide221
Missing import for inference
4835c87
verified
about 2 years ago
.gitattributes
1.52 kB
Duplicate from banghua/n_rm
about 2 years ago
README.md
6.67 kB
Missing import for inference
about 2 years ago
latest
15 Bytes
Duplicate from banghua/n_rm
about 2 years ago
pytorch_model.bin
26.7 GB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_0.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_1.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_2.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_3.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_4.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_5.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_6.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_7.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
trainer_state.json
18.9 kB
Duplicate from banghua/n_rm
about 2 years ago
training_args.bin
5.31 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
zero_to_fp32.py
24.2 kB
Duplicate from banghua/n_rm
about 2 years ago