Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
hf-doc-build
/
doc-dev
Follow
HuggingFace Doc Builds
34
Files
xet
hf-doc-build/doc-dev
/
trl
/
pr_5546
/
en
185 GB
2,945,341 files
Updated about 2 hours ago
Ctrl+K
Name
Size
Uploaded
Xet hash
_app
about 1 month ago
236 items
_toctree.yml
3.35 kB
xet
about 1 month ago
ac839051
async_grpo_trainer.html
108 kB
xet
about 1 month ago
da2b7161
async_grpo_trainer.md
12.3 kB
xet
about 1 month ago
fa4b8068
bco_trainer.html
130 kB
xet
about 1 month ago
313ca5bb
bco_trainer.md
14.1 kB
xet
about 1 month ago
0ffa53d1
bema_for_reference_model.html
90 kB
xet
about 1 month ago
196adf99
bema_for_reference_model.md
7.18 kB
xet
about 1 month ago
3f348286
callbacks.html
95.8 kB
xet
about 1 month ago
ddb77300
callbacks.md
7.88 kB
xet
about 1 month ago
099ed4f4
chat_template_utils.html
45.8 kB
xet
about 1 month ago
365f613c
chat_template_utils.md
5.48 kB
xet
about 1 month ago
ab3151bf
chat_templates.html
28.3 kB
xet
about 1 month ago
3690266a
chat_templates.md
5.13 kB
xet
about 1 month ago
81b75764
clis.html
53.4 kB
xet
about 1 month ago
11a3e004
clis.md
12.8 kB
xet
about 1 month ago
db07946e
community_tutorials.html
31 kB
xet
about 1 month ago
36a50a32
community_tutorials.md
11 kB
xet
about 1 month ago
1ede2f05
cpo_trainer.html
144 kB
xet
about 1 month ago
e71cce24
cpo_trainer.md
23 kB
xet
about 1 month ago
ff00b008
customization.html
31 kB
xet
about 1 month ago
a2d020af
customization.md
4.08 kB
xet
about 1 month ago
e35b8043
data_utils.html
62.9 kB
xet
about 1 month ago
660018d9
data_utils.md
6.32 kB
xet
about 1 month ago
47f29775
dataset_formats.html
223 kB
xet
about 1 month ago
dd6b5148
dataset_formats.md
40.2 kB
xet
about 1 month ago
c95c0582
deepspeed_integration.html
15.8 kB
xet
about 1 month ago
fe162ffc
deepspeed_integration.md
1.48 kB
xet
about 1 month ago
96460e48
distillation_trainer.html
160 kB
xet
about 1 month ago
8e089a09
distillation_trainer.md
12.3 kB
xet
about 1 month ago
2921b01f
distributing_training.html
96.5 kB
xet
about 1 month ago
fb34d163
distributing_training.md
21 kB
xet
about 1 month ago
400cb97c
dpo_trainer.html
232 kB
xet
about 1 month ago
895f4a3b
dpo_trainer.md
30.8 kB
xet
about 1 month ago
95b8297d
example_overview.html
42.8 kB
xet
about 1 month ago
fc0a7525
example_overview.md
19.7 kB
xet
about 1 month ago
b18e5b1c
experimental_overview.html
10.8 kB
xet
about 1 month ago
bd5f9e37
experimental_overview.md
1.6 kB
xet
about 1 month ago
d2f8cf27
favicon.png
1.57 kB
xet
about 1 month ago
6e06dd7b
gfpo.html
84.4 kB
xet
about 1 month ago
80aa326c
gfpo.md
5.24 kB
xet
about 1 month ago
8f836397
gkd_trainer.html
119 kB
xet
about 1 month ago
5a1bb8c8
gkd_trainer.md
14.1 kB
xet
about 1 month ago
ffb7a6fc
gold_trainer.html
166 kB
xet
about 1 month ago
c32e1e9a
gold_trainer.md
18.1 kB
xet
about 1 month ago
7c9df577
grpo_trainer.html
560 kB
xet
about 1 month ago
5eb7cea1
grpo_trainer.md
55.8 kB
xet
about 1 month ago
1c60829f
grpo_with_replay_buffer.html
89.6 kB
xet
about 1 month ago
0817831d
grpo_with_replay_buffer.md
6.16 kB
xet
about 1 month ago
5c67fe83
gspo_token.html
50.2 kB
xet
about 1 month ago
bd23614d
gspo_token.md
4.27 kB
xet
about 1 month ago
5a723188
index.html
29 kB
xet
about 1 month ago
17025992
index.md
3.66 kB
xet
about 1 month ago
0c94ab19
installation.html
13.9 kB
xet
about 1 month ago
cb036430
installation.md
721 Bytes
xet
about 1 month ago
93a212e0
jobs_training.html
32.9 kB
xet
about 1 month ago
293562e9
jobs_training.md
6.44 kB
xet
about 1 month ago
7e612e1d
kernels_hub.html
26.8 kB
xet
about 1 month ago
7a69f742
kernels_hub.md
4.18 kB
xet
about 1 month ago
6982a1e3
kto_trainer.html
137 kB
xet
about 1 month ago
4ac8e3c3
kto_trainer.md
18.1 kB
xet
about 1 month ago
435ae863
liger_kernel_integration.html
15 kB
xet
about 1 month ago
021bba59
liger_kernel_integration.md
2.2 kB
xet
about 1 month ago
0ab2ea33
llms-full.txt
842 kB
xet
about 1 month ago
ff056f6c
llms.txt
5.08 kB
xet
about 1 month ago
5b04522f
lora_without_regret.html
45 kB
xet
about 1 month ago
7de468ff
lora_without_regret.md
14.4 kB
xet
about 1 month ago
c3be6b18
merge_model_callback.html
17.8 kB
xet
about 1 month ago
7e7e0655
merge_model_callback.md
1.19 kB
xet
about 1 month ago
86f6b5a9
minillm_trainer.html
179 kB
xet
about 1 month ago
98e6ce71
minillm_trainer.md
16.4 kB
xet
about 1 month ago
6c0d830a
nash_md_trainer.html
124 kB
xet
about 1 month ago
15e1e3cf
nash_md_trainer.md
16.3 kB
xet
about 1 month ago
0b173706
nemo_gym.html
58.5 kB
xet
about 1 month ago
e6d5bf9f
nemo_gym.md
10.3 kB
xet
about 1 month ago
b5b12cc0
objects.inv
2.41 kB
xet
about 1 month ago
21e855f8
online_dpo_trainer.html
179 kB
xet
about 1 month ago
fc613329
online_dpo_trainer.md
23.1 kB
xet
about 1 month ago
04a915df
openenv.html
114 kB
xet
about 1 month ago
516dac3d
openenv.md
27.8 kB
xet
about 1 month ago
8fdd9dc4
orpo_trainer.html
127 kB
xet
about 1 month ago
e31bc6e0
orpo_trainer.md
17.7 kB
xet
about 1 month ago
559bd659
paper_index.html
734 kB
xet
about 1 month ago
50ebb27f
paper_index.md
19.1 kB
xet
about 1 month ago
ffd1dcbe
papo_trainer.html
111 kB
xet
about 1 month ago
a72b5d16
papo_trainer.md
8.82 kB
xet
about 1 month ago
d6e1d13c
peft_integration.html
131 kB
xet
about 1 month ago
dc6010e2
peft_integration.md
23.9 kB
xet
about 1 month ago
947af9d1
ppo_trainer.html
278 kB
xet
about 1 month ago
b315a699
ppo_trainer.md
41.4 kB
xet
about 1 month ago
b0768695
prm_trainer.html
117 kB
xet
about 1 month ago
991cc825
prm_trainer.md
13.9 kB
xet
about 1 month ago
8b1416a8
ptt_integration.html
29.3 kB
xet
about 1 month ago
480dc308
ptt_integration.md
4.27 kB
xet
about 1 month ago
6a21357a
quickstart.html
37.4 kB
xet
about 1 month ago
3a165e5b
quickstart.md
3.25 kB
xet
about 1 month ago
1c4455a3
rapidfire_integration.html
81.1 kB
xet
about 1 month ago
1bcb1071
rapidfire_integration.md
12.8 kB
xet
about 1 month ago
78b07e81
reducing_memory_usage.html
52.4 kB
xet
about 1 month ago
26763cba
reducing_memory_usage.md
12.3 kB
xet
about 1 month ago
d61af82e
Load more
Sync this bucket
Mount this bucket
Total size
185 GB
Files
2,945,341
Last updated
May 28
Pre-warmed CDN
US
EU
US
EU
Contributors