Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
hf-doc-build
/
doc-dev
Follow
HuggingFace Doc Builds
34
Files
xet
hf-doc-build/doc-dev
/
trl
/
pr_5481
/
en
170 GB
2,741,484 files
Updated about 18 hours ago
Ctrl+K
Name
Size
Uploaded
Xet hash
_app
about 1 month ago
135 items
_toctree.yml
3.29 kB
xet
about 1 month ago
4ea3f42b
async_grpo_trainer.html
108 kB
xet
about 1 month ago
46a0cdeb
async_grpo_trainer.md
12.3 kB
xet
about 1 month ago
5cbae078
bco_trainer.html
130 kB
xet
about 1 month ago
de8f2254
bco_trainer.md
14.1 kB
xet
about 1 month ago
67436a85
bema_for_reference_model.html
90 kB
xet
about 1 month ago
faa498ad
bema_for_reference_model.md
7.18 kB
xet
about 1 month ago
ca7b17fe
callbacks.html
95.8 kB
xet
about 1 month ago
fc597cab
callbacks.md
7.88 kB
xet
about 1 month ago
b7e57e01
chat_template_utils.html
45.5 kB
xet
about 1 month ago
5fa51bff
chat_template_utils.md
5.26 kB
xet
about 1 month ago
906630f0
clis.html
53.4 kB
xet
about 1 month ago
f3d31644
clis.md
12.8 kB
xet
about 1 month ago
19cc442a
community_tutorials.html
31 kB
xet
about 1 month ago
3218c942
community_tutorials.md
11 kB
xet
about 1 month ago
c4e872f2
cpo_trainer.html
144 kB
xet
about 1 month ago
0576b589
cpo_trainer.md
23 kB
xet
about 1 month ago
5af8759e
customization.html
31 kB
xet
about 1 month ago
68480f63
customization.md
4.08 kB
xet
about 1 month ago
94e51167
data_utils.html
62.9 kB
xet
about 1 month ago
661875ba
data_utils.md
6.32 kB
xet
about 1 month ago
067c6e4d
dataset_formats.html
223 kB
xet
about 1 month ago
ecfdf379
dataset_formats.md
40.2 kB
xet
about 1 month ago
0fec81a9
deepspeed_integration.html
15.8 kB
xet
about 1 month ago
a20037e1
deepspeed_integration.md
1.48 kB
xet
about 1 month ago
96460e48
distributing_training.html
96.5 kB
xet
about 1 month ago
565d706c
distributing_training.md
21 kB
xet
about 1 month ago
400cb97c
dpo_trainer.html
232 kB
xet
about 1 month ago
6395c3b4
dpo_trainer.md
30.7 kB
xet
about 1 month ago
96ad7488
example_overview.html
43.2 kB
xet
about 1 month ago
0c9c0f17
example_overview.md
20.1 kB
xet
about 1 month ago
6ee150da
experimental_overview.html
10.8 kB
xet
about 1 month ago
f5a300e6
experimental_overview.md
1.6 kB
xet
about 1 month ago
d2f8cf27
favicon.png
1.57 kB
xet
about 1 month ago
6e06dd7b
gfpo.html
84.4 kB
xet
about 1 month ago
4ce62ba5
gfpo.md
5.24 kB
xet
about 1 month ago
f6249e7c
gkd_trainer.html
119 kB
xet
about 1 month ago
71d72884
gkd_trainer.md
14.1 kB
xet
about 1 month ago
75f0c195
gold_trainer.html
163 kB
xet
about 1 month ago
3abfb74f
gold_trainer.md
17.5 kB
xet
about 1 month ago
d2373003
grpo_trainer.html
559 kB
xet
about 1 month ago
4197a6a2
grpo_trainer.md
54.9 kB
xet
about 1 month ago
f88f7da1
grpo_with_replay_buffer.html
89.6 kB
xet
about 1 month ago
e8face0a
grpo_with_replay_buffer.md
6.16 kB
xet
about 1 month ago
a30113ba
gspo_token.html
50.2 kB
xet
about 1 month ago
691acc14
gspo_token.md
4.27 kB
xet
about 1 month ago
01f92f98
index.html
29 kB
xet
about 1 month ago
e991d237
index.md
3.66 kB
xet
about 1 month ago
0c94ab19
installation.html
13.9 kB
xet
about 1 month ago
b11a173c
installation.md
721 Bytes
xet
about 1 month ago
93a212e0
jobs_training.html
32.9 kB
xet
about 1 month ago
9c4372c6
jobs_training.md
6.44 kB
xet
about 1 month ago
7e612e1d
judges.html
122 kB
xet
about 1 month ago
cedbaee8
judges.md
15.8 kB
xet
about 1 month ago
4db978bf
kernels_hub.html
26.8 kB
xet
about 1 month ago
fd148876
kernels_hub.md
4.18 kB
xet
about 1 month ago
6982a1e3
kto_trainer.html
139 kB
xet
about 1 month ago
db2701e5
kto_trainer.md
17.5 kB
xet
about 1 month ago
d1800b9d
liger_kernel_integration.html
15 kB
xet
about 1 month ago
bace6f47
liger_kernel_integration.md
2.2 kB
xet
about 1 month ago
0ab2ea33
llms-full.txt
833 kB
xet
about 1 month ago
9b0b7f6d
llms.txt
5.04 kB
xet
about 1 month ago
16a636de
lora_without_regret.html
45 kB
xet
about 1 month ago
eb503693
lora_without_regret.md
14.4 kB
xet
about 1 month ago
c3be6b18
merge_model_callback.html
17.8 kB
xet
about 1 month ago
81b19000
merge_model_callback.md
1.19 kB
xet
about 1 month ago
ce44dbff
minillm_trainer.html
179 kB
xet
about 1 month ago
4ff51d84
minillm_trainer.md
16.4 kB
xet
about 1 month ago
6c1033f8
nash_md_trainer.html
130 kB
xet
about 1 month ago
95165848
nash_md_trainer.md
17.1 kB
xet
about 1 month ago
36aac54f
nemo_gym.html
58.5 kB
xet
about 1 month ago
e0dc037d
nemo_gym.md
10.3 kB
xet
about 1 month ago
b5b12cc0
objects.inv
2.45 kB
xet
about 1 month ago
5f3834b0
online_dpo_trainer.html
195 kB
xet
about 1 month ago
0d1de3b5
online_dpo_trainer.md
26.3 kB
xet
about 1 month ago
2d328936
openenv.html
114 kB
xet
about 1 month ago
3f508ab3
openenv.md
27.8 kB
xet
about 1 month ago
43609d58
orpo_trainer.html
127 kB
xet
about 1 month ago
6f7e3d2d
orpo_trainer.md
17.7 kB
xet
about 1 month ago
73f49606
paper_index.html
732 kB
xet
about 1 month ago
c847e76e
paper_index.md
17 kB
xet
about 1 month ago
8d26e47a
papo_trainer.html
111 kB
xet
about 1 month ago
d238e504
papo_trainer.md
8.82 kB
xet
about 1 month ago
1246f857
peft_integration.html
131 kB
xet
about 1 month ago
c18f4afc
peft_integration.md
23.9 kB
xet
about 1 month ago
22dc5737
ppo_trainer.html
280 kB
xet
about 1 month ago
c660368d
ppo_trainer.md
42 kB
xet
about 1 month ago
b1be0d90
prm_trainer.html
117 kB
xet
about 1 month ago
a49a2c30
prm_trainer.md
13.9 kB
xet
about 1 month ago
f469ebb9
ptt_integration.html
29.3 kB
xet
about 1 month ago
9a3ba8ee
ptt_integration.md
4.27 kB
xet
about 1 month ago
6a21357a
quickstart.html
37.4 kB
xet
about 1 month ago
3c5f386a
quickstart.md
3.25 kB
xet
about 1 month ago
1c4455a3
rapidfire_integration.html
81.1 kB
xet
about 1 month ago
0ac10e98
rapidfire_integration.md
12.8 kB
xet
about 1 month ago
c14e6d99
reducing_memory_usage.html
52.4 kB
xet
about 1 month ago
723c4837
reducing_memory_usage.md
12.3 kB
xet
about 1 month ago
d61af82e
reward_trainer.html
176 kB
xet
about 1 month ago
d1bc500a
reward_trainer.md
21.5 kB
xet
about 1 month ago
39bc688d
Load more
Sync this bucket
Mount this bucket
Total size
170 GB
Files
2,741,484
Last updated
May 23
Pre-warmed CDN
US
EU
US
EU
Contributors