Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
novateur
/
bbb
like
0
TensorBoard
Safetensors
arxiv:
2408.05517
arxiv:
2309.00986
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
main
bbb
/
examples
/
train
/
rlhf
6.5 kB
1 contributor
History:
1 commit
novateur
Add files using upload-large-folder tool
c4532b4
verified
9 months ago
dpo
Add files using upload-large-folder tool
9 months ago
README.md
94 Bytes
Add files using upload-large-folder tool
9 months ago
cpo.sh
785 Bytes
Add files using upload-large-folder tool
9 months ago
kto.sh
787 Bytes
Add files using upload-large-folder tool
9 months ago
orpo.sh
786 Bytes
Add files using upload-large-folder tool
9 months ago
ppo.sh
1.13 kB
Add files using upload-large-folder tool
9 months ago
rm.sh
784 Bytes
Add files using upload-large-folder tool
9 months ago
simpo.sh
720 Bytes
Add files using upload-large-folder tool
9 months ago