Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
hf-doc-build
/
doc-dev
Follow
HuggingFace Doc Builds
34
Files
xet
hf-doc-build/doc-dev
/
trl
/
pr_5538
/
en
190 GB
3,006,032 files
Updated about 2 hours ago
Ctrl+K
Name
Size
Uploaded
Xet hash
_app
about 1 month ago
137 items
_toctree.yml
3.38 kB
xet
about 1 month ago
9da8f3eb
async_grpo_trainer.html
108 kB
xet
about 1 month ago
885a1c04
async_grpo_trainer.md
12.3 kB
xet
about 1 month ago
753dfb82
bco_trainer.html
130 kB
xet
about 1 month ago
beeaf251
bco_trainer.md
14.1 kB
xet
about 1 month ago
e7a48bd9
bema_for_reference_model.html
90 kB
xet
about 1 month ago
dd78bb3e
bema_for_reference_model.md
7.18 kB
xet
about 1 month ago
7aac0a50
callbacks.html
95.8 kB
xet
about 1 month ago
a025e2f3
callbacks.md
7.88 kB
xet
about 1 month ago
3c2df48b
chat_template_utils.html
45.8 kB
xet
about 1 month ago
bf82d48a
chat_template_utils.md
5.48 kB
xet
about 1 month ago
02ff7878
chat_templates.html
28.3 kB
xet
about 1 month ago
fc031dda
chat_templates.md
5.13 kB
xet
about 1 month ago
40f43b10
clis.html
53.4 kB
xet
about 1 month ago
25162847
clis.md
12.8 kB
xet
about 1 month ago
5f9ee25f
community_tutorials.html
31 kB
xet
about 1 month ago
ab62dfb0
community_tutorials.md
11 kB
xet
about 1 month ago
a55680f6
cpo_trainer.html
144 kB
xet
about 1 month ago
b4dad6c1
cpo_trainer.md
23 kB
xet
about 1 month ago
15a7faa8
customization.html
31 kB
xet
about 1 month ago
e8bfe075
customization.md
4.08 kB
xet
about 1 month ago
5a84b9e8
data_utils.html
62.7 kB
xet
about 1 month ago
841db7e3
data_utils.md
6.87 kB
xet
about 1 month ago
1b21f5d4
dataset_formats.html
223 kB
xet
about 1 month ago
08278541
dataset_formats.md
40.2 kB
xet
about 1 month ago
deab9ecc
deepspeed_integration.html
15.8 kB
xet
about 1 month ago
d1799241
deepspeed_integration.md
1.48 kB
xet
about 1 month ago
96460e48
distillation_trainer.html
160 kB
xet
about 1 month ago
88813181
distillation_trainer.md
12.3 kB
xet
about 1 month ago
aaf46eb0
distributing_training.html
96.5 kB
xet
about 1 month ago
9b5746dd
distributing_training.md
21 kB
xet
about 1 month ago
400cb97c
dpo_trainer.html
232 kB
xet
about 1 month ago
b8c2a786
dpo_trainer.md
30.8 kB
xet
about 1 month ago
49320d3b
example_overview.html
42.8 kB
xet
about 1 month ago
38afa307
example_overview.md
19.7 kB
xet
about 1 month ago
b58b76ea
experimental_overview.html
10.8 kB
xet
about 1 month ago
15cb039e
experimental_overview.md
1.6 kB
xet
about 1 month ago
d2f8cf27
favicon.png
1.57 kB
xet
about 1 month ago
6e06dd7b
gfpo.html
84.4 kB
xet
about 1 month ago
7fc1ab4d
gfpo.md
5.24 kB
xet
about 1 month ago
d443a181
gkd_trainer.html
119 kB
xet
about 1 month ago
e2607b3f
gkd_trainer.md
14.1 kB
xet
about 1 month ago
43354d1a
gold_trainer.html
166 kB
xet
about 1 month ago
c34d69a1
gold_trainer.md
18.1 kB
xet
about 1 month ago
75cdab17
grpo_trainer.html
560 kB
xet
about 1 month ago
d69734f6
grpo_trainer.md
55.8 kB
xet
about 1 month ago
a51139d8
grpo_with_replay_buffer.html
89.6 kB
xet
about 1 month ago
c936f2f2
grpo_with_replay_buffer.md
6.16 kB
xet
about 1 month ago
3f1d0bf5
gspo_token.html
50.2 kB
xet
about 1 month ago
58525c24
gspo_token.md
4.27 kB
xet
about 1 month ago
4aa39892
index.html
29 kB
xet
about 1 month ago
ccba1f04
index.md
3.66 kB
xet
about 1 month ago
0c94ab19
installation.html
13.9 kB
xet
about 1 month ago
dc8d32c4
installation.md
721 Bytes
xet
about 1 month ago
93a212e0
jobs_training.html
32.9 kB
xet
about 1 month ago
c601c830
jobs_training.md
6.44 kB
xet
about 1 month ago
7e612e1d
kernels_hub.html
26.8 kB
xet
about 1 month ago
6fa8d85c
kernels_hub.md
4.18 kB
xet
about 1 month ago
6982a1e3
kto_trainer.html
137 kB
xet
about 1 month ago
7a52de74
kto_trainer.md
18.4 kB
xet
about 1 month ago
371a2933
liger_kernel_integration.html
15 kB
xet
about 1 month ago
5b669f9c
liger_kernel_integration.md
2.2 kB
xet
about 1 month ago
0ab2ea33
llms-full.txt
862 kB
xet
about 1 month ago
e0f7d1fb
llms.txt
5.15 kB
xet
about 1 month ago
de2de41d
lora_without_regret.html
45 kB
xet
about 1 month ago
2c540ebe
lora_without_regret.md
14.4 kB
xet
about 1 month ago
c3be6b18
merge_model_callback.html
17.8 kB
xet
about 1 month ago
cdc9383c
merge_model_callback.md
1.19 kB
xet
about 1 month ago
fe93ce17
minillm_trainer.html
179 kB
xet
about 1 month ago
f536233f
minillm_trainer.md
16.4 kB
xet
about 1 month ago
3518ea35
nash_md_trainer.html
124 kB
xet
about 1 month ago
6af743b0
nash_md_trainer.md
16.3 kB
xet
about 1 month ago
d4e9abc4
nemo_gym.html
58.5 kB
xet
about 1 month ago
e392b492
nemo_gym.md
10.3 kB
xet
about 1 month ago
b5b12cc0
objects.inv
2.45 kB
xet
about 1 month ago
32bea30a
online_dpo_trainer.html
179 kB
xet
about 1 month ago
2f3b164c
online_dpo_trainer.md
23.1 kB
xet
about 1 month ago
409c9bb2
openenv.html
114 kB
xet
about 1 month ago
2fd544d5
openenv.md
27.8 kB
xet
about 1 month ago
f762106b
orpo_trainer.html
127 kB
xet
about 1 month ago
f1a29380
orpo_trainer.md
17.7 kB
xet
about 1 month ago
da69ec80
paper_index.html
742 kB
xet
about 1 month ago
dd0596bf
paper_index.md
19.1 kB
xet
about 1 month ago
3acacbd1
papo_trainer.html
111 kB
xet
about 1 month ago
eed2c50b
papo_trainer.md
8.82 kB
xet
about 1 month ago
0ba4bdc7
peft_integration.html
131 kB
xet
about 1 month ago
1fd4b7f1
peft_integration.md
23.9 kB
xet
about 1 month ago
9ef65f83
ppo_trainer.html
278 kB
xet
about 1 month ago
007ea87a
ppo_trainer.md
41.4 kB
xet
about 1 month ago
720fa865
prm_trainer.html
117 kB
xet
about 1 month ago
7aa72565
prm_trainer.md
13.9 kB
xet
about 1 month ago
ecb05ef5
ptt_integration.html
29.3 kB
xet
about 1 month ago
4a921c7a
ptt_integration.md
4.27 kB
xet
about 1 month ago
6a21357a
quickstart.html
37.4 kB
xet
about 1 month ago
26882b81
quickstart.md
3.25 kB
xet
about 1 month ago
1c4455a3
rapidfire_integration.html
81.1 kB
xet
about 1 month ago
66b3b89a
rapidfire_integration.md
12.8 kB
xet
about 1 month ago
78b07e81
reducing_memory_usage.html
52.4 kB
xet
about 1 month ago
3d24df1e
reducing_memory_usage.md
12.3 kB
xet
about 1 month ago
d61af82e
Load more
Sync this bucket
Mount this bucket
Total size
190 GB
Files
3,006,032
Last updated
May 30
Pre-warmed CDN
US
EU
US
EU
Contributors