Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
berkeley-nest
/
Starling-RM-7B-alpha
like
103
Follow
Berkeley-Nest
79
Transformers
PyTorch
berkeley-nest/Nectar
English
llama
reward model
RLHF
RLAIF
text-generation-inference
arxiv:
2203.02155
arxiv:
2301.11270
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
8
Deploy
Use this model
refs/pr/3
Starling-RM-7B-alpha
26.7 GB
6 contributors
History:
14 commits
HenriqueMendes
Luciano
fb18d4f
about 2 years ago
.gitattributes
1.52 kB
Duplicate from banghua/n_rm
about 2 years ago
Henrique - Sem título 15 de nov. de 2023 2232 2023-11-15 22_34.m4a
170 kB
Luciano
about 2 years ago
README.md
6.63 kB
Fix issues in sample code: Invalid reward_tokenizer and issue in returning scores (#1)
about 2 years ago
latest
15 Bytes
Duplicate from banghua/n_rm
about 2 years ago
pytorch_model.bin
26.7 GB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_0.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_1.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_2.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_3.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_4.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_5.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_6.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
rng_state_7.pth
21.7 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
trainer_state.json
18.9 kB
Duplicate from banghua/n_rm
about 2 years ago
training_args.bin
5.31 kB
xet
Duplicate from banghua/n_rm
about 2 years ago
zero_to_fp32.py
24.2 kB
Duplicate from banghua/n_rm
about 2 years ago