Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
hf-doc-build
/
doc-dev
Follow
HuggingFace Doc Builds
34
Files
xet
hf-doc-build/doc-dev
/
trl
/
pr_5547
/
en
185 GB
2,945,341 files
Updated about 2 hours ago
Ctrl+K
Name
Size
Uploaded
Xet hash
_app
about 1 month ago
135 items
_toctree.yml
3.29 kB
xet
about 1 month ago
3ae9d845
async_grpo_trainer.html
108 kB
xet
about 1 month ago
cd7d396c
async_grpo_trainer.md
12.3 kB
xet
about 1 month ago
6fb977dc
bco_trainer.html
130 kB
xet
about 1 month ago
09fe814b
bco_trainer.md
14.1 kB
xet
about 1 month ago
9cd92c06
bema_for_reference_model.html
90 kB
xet
about 1 month ago
7658e762
bema_for_reference_model.md
7.18 kB
xet
about 1 month ago
fb79753b
callbacks.html
95.8 kB
xet
about 1 month ago
380a8c8e
callbacks.md
7.88 kB
xet
about 1 month ago
2aeb368b
chat_template_utils.html
45.5 kB
xet
about 1 month ago
bc4d1503
chat_template_utils.md
5.3 kB
xet
about 1 month ago
472f9177
clis.html
53.4 kB
xet
about 1 month ago
243fff5a
clis.md
12.8 kB
xet
about 1 month ago
30321c91
community_tutorials.html
31 kB
xet
about 1 month ago
ee473152
community_tutorials.md
11 kB
xet
about 1 month ago
076643c8
cpo_trainer.html
144 kB
xet
about 1 month ago
9151355d
cpo_trainer.md
23 kB
xet
about 1 month ago
4609f0a6
customization.html
31 kB
xet
about 1 month ago
7e6bb39e
customization.md
4.08 kB
xet
about 1 month ago
0f6393e7
data_utils.html
62.9 kB
xet
about 1 month ago
622f7024
data_utils.md
6.32 kB
xet
about 1 month ago
c2932103
dataset_formats.html
223 kB
xet
about 1 month ago
a7d7abe0
dataset_formats.md
40.2 kB
xet
about 1 month ago
6d251874
deepspeed_integration.html
15.8 kB
xet
about 1 month ago
61824258
deepspeed_integration.md
1.48 kB
xet
about 1 month ago
96460e48
distillation_trainer.html
160 kB
xet
about 1 month ago
e647a489
distillation_trainer.md
12.3 kB
xet
about 1 month ago
44ce3224
distributing_training.html
96.5 kB
xet
about 1 month ago
de28e36d
distributing_training.md
21 kB
xet
about 1 month ago
400cb97c
dpo_trainer.html
232 kB
xet
about 1 month ago
5b8ea185
dpo_trainer.md
30.8 kB
xet
about 1 month ago
03bb839c
example_overview.html
42.7 kB
xet
about 1 month ago
78028cca
example_overview.md
19.7 kB
xet
about 1 month ago
9840e54f
experimental_overview.html
10.8 kB
xet
about 1 month ago
5e33f49e
experimental_overview.md
1.6 kB
xet
about 1 month ago
d2f8cf27
favicon.png
1.57 kB
xet
about 1 month ago
6e06dd7b
gfpo.html
84.4 kB
xet
about 1 month ago
03109c69
gfpo.md
5.24 kB
xet
about 1 month ago
e7dd2e5f
gkd_trainer.html
119 kB
xet
about 1 month ago
563378d9
gkd_trainer.md
14.1 kB
xet
about 1 month ago
c2f65f78
gold_trainer.html
166 kB
xet
about 1 month ago
dd4578d3
gold_trainer.md
18.1 kB
xet
about 1 month ago
b9f843ae
grpo_trainer.html
559 kB
xet
about 1 month ago
667386d8
grpo_trainer.md
55.2 kB
xet
about 1 month ago
1b52292b
grpo_with_replay_buffer.html
89.6 kB
xet
about 1 month ago
23b0f6cf
grpo_with_replay_buffer.md
6.16 kB
xet
about 1 month ago
36983621
gspo_token.html
50.2 kB
xet
about 1 month ago
2600ef86
gspo_token.md
4.27 kB
xet
about 1 month ago
d8c6f6b8
index.html
29 kB
xet
about 1 month ago
965218fb
index.md
3.66 kB
xet
about 1 month ago
0c94ab19
installation.html
13.9 kB
xet
about 1 month ago
b880e99a
installation.md
721 Bytes
xet
about 1 month ago
93a212e0
jobs_training.html
32.9 kB
xet
about 1 month ago
d888f860
jobs_training.md
6.44 kB
xet
about 1 month ago
7e612e1d
kernels_hub.html
26.8 kB
xet
about 1 month ago
b646a5fd
kernels_hub.md
4.18 kB
xet
about 1 month ago
6982a1e3
kto_trainer.html
142 kB
xet
about 1 month ago
de3c801e
kto_trainer.md
18.4 kB
xet
about 1 month ago
c69e2963
liger_kernel_integration.html
15 kB
xet
about 1 month ago
30d8ddf2
liger_kernel_integration.md
2.2 kB
xet
about 1 month ago
0ab2ea33
llms-full.txt
836 kB
xet
about 1 month ago
72a97d2a
llms.txt
5 kB
xet
about 1 month ago
6227fbb1
lora_without_regret.html
45 kB
xet
about 1 month ago
48f4ab73
lora_without_regret.md
14.4 kB
xet
about 1 month ago
c3be6b18
merge_model_callback.html
17.8 kB
xet
about 1 month ago
2ef6da29
merge_model_callback.md
1.19 kB
xet
about 1 month ago
dba06b3a
minillm_trainer.html
179 kB
xet
about 1 month ago
f4f61ba8
minillm_trainer.md
16.4 kB
xet
about 1 month ago
660582ca
nash_md_trainer.html
125 kB
xet
about 1 month ago
93782edd
nash_md_trainer.md
16.3 kB
xet
about 1 month ago
bcd823b9
nemo_gym.html
58.5 kB
xet
about 1 month ago
fd00c6c1
nemo_gym.md
10.3 kB
xet
about 1 month ago
b5b12cc0
objects.inv
2.41 kB
xet
about 1 month ago
fe63106b
online_dpo_trainer.html
181 kB
xet
about 1 month ago
d8074e3c
online_dpo_trainer.md
23.1 kB
xet
about 1 month ago
66792dc0
openenv.html
114 kB
xet
about 1 month ago
0b6c3b39
openenv.md
27.8 kB
xet
about 1 month ago
a6d6df3f
orpo_trainer.html
127 kB
xet
about 1 month ago
eb45c49f
orpo_trainer.md
17.7 kB
xet
about 1 month ago
7a7569b0
paper_index.html
734 kB
xet
about 1 month ago
2c1000a0
paper_index.md
19.1 kB
xet
about 1 month ago
44492bb4
papo_trainer.html
111 kB
xet
about 1 month ago
1e3c20a7
papo_trainer.md
8.82 kB
xet
about 1 month ago
16f7c5e4
peft_integration.html
131 kB
xet
about 1 month ago
1593d6a2
peft_integration.md
23.9 kB
xet
about 1 month ago
c41939c8
ppo_trainer.html
278 kB
xet
about 1 month ago
89879b91
ppo_trainer.md
41.4 kB
xet
about 1 month ago
98f2b61a
prm_trainer.html
117 kB
xet
about 1 month ago
f1dea9a4
prm_trainer.md
13.9 kB
xet
about 1 month ago
7d2dc4db
ptt_integration.html
29.3 kB
xet
about 1 month ago
e5ed64e5
ptt_integration.md
4.27 kB
xet
about 1 month ago
6a21357a
quickstart.html
37.4 kB
xet
about 1 month ago
f0bf3af5
quickstart.md
3.25 kB
xet
about 1 month ago
1c4455a3
rapidfire_integration.html
81.1 kB
xet
about 1 month ago
4acab1b6
rapidfire_integration.md
12.8 kB
xet
about 1 month ago
78b07e81
reducing_memory_usage.html
52.4 kB
xet
about 1 month ago
e5dedf21
reducing_memory_usage.md
12.3 kB
xet
about 1 month ago
d61af82e
reward_trainer.html
175 kB
xet
about 1 month ago
45c15c2b
reward_trainer.md
21.5 kB
xet
about 1 month ago
be03a46b
Load more
Sync this bucket
Mount this bucket
Total size
185 GB
Files
2,945,341
Last updated
May 28
Pre-warmed CDN
US
EU
US
EU
Contributors