Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
berkeley-nest
/
Starling-RM-7B-alpha
like
103
Follow
Berkeley-Nest
80
Transformers
PyTorch
berkeley-nest/Nectar
English
llama
reward model
RLHF
RLAIF
text-generation-inference
arxiv:
2203.02155
arxiv:
2301.11270
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
8
Deploy
Use this model
refs/pr/8
Starling-RM-7B-alpha
53.4 GB
6 contributors
History:
17 commits
SFconvertbot
Adding `safetensors` variant of this model
1f913fb
verified
10 months ago
.gitattributes
1.52 kB
Duplicate from banghua/n_rm
about 2 years ago
README.md
6.73 kB
Update README.md
almost 2 years ago
config.json
621 Bytes
Create config.json
over 1 year ago
latest
15 Bytes
Duplicate from banghua/n_rm
about 2 years ago
model.safetensors
26.7 GB
xet
Adding `safetensors` variant of this model
10 months ago
pytorch_model.bin
26.7 GB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_0.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_1.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_2.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_3.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_4.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_5.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_6.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_7.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
trainer_state.json
18.9 kB
Duplicate from banghua/n_rm
about 2 years ago
training_args.bin
5.31 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
zero_to_fp32.py
24.2 kB
Duplicate from banghua/n_rm
about 2 years ago