Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
hf-doc-build
/
doc-dev
Follow
HuggingFace Doc Builds
34
Files
xet
hf-doc-build/doc-dev
/
trl
/
pr_5642
/
en
190 GB
3,011,154 files
Updated about 3 hours ago
Ctrl+K
Name
Size
Uploaded
Xet hash
_app
about 1 month ago
152 items
_toctree.yml
3.38 kB
xet
about 1 month ago
9da8f3eb
async_grpo_trainer.html
108 kB
xet
about 1 month ago
bc332607
async_grpo_trainer.md
12.3 kB
xet
about 1 month ago
4af6943d
bco_trainer.html
130 kB
xet
about 1 month ago
a873229e
bco_trainer.md
14.1 kB
xet
about 1 month ago
fedb7e3d
bema_for_reference_model.html
90 kB
xet
about 1 month ago
e30412a9
bema_for_reference_model.md
7.18 kB
xet
about 1 month ago
23a5f9d6
callbacks.html
95.8 kB
xet
about 1 month ago
a4c89fb3
callbacks.md
7.88 kB
xet
about 1 month ago
961ca704
chat_template_utils.html
46.2 kB
xet
about 1 month ago
340b3654
chat_template_utils.md
5.77 kB
xet
about 1 month ago
06fc128b
chat_templates.html
35.4 kB
xet
about 1 month ago
b7cd8177
chat_templates.md
6.48 kB
xet
about 1 month ago
d14cfb86
clis.html
53.4 kB
xet
about 1 month ago
2cddee3a
clis.md
12.8 kB
xet
about 1 month ago
3eb0fd7d
community_tutorials.html
31 kB
xet
about 1 month ago
3f4c241c
community_tutorials.md
11 kB
xet
about 1 month ago
c137803e
cpo_trainer.html
144 kB
xet
about 1 month ago
1a947d26
cpo_trainer.md
23 kB
xet
about 1 month ago
5cda1242
customization.html
31 kB
xet
about 1 month ago
8f6ccfca
customization.md
4.08 kB
xet
about 1 month ago
779b7678
data_utils.html
62.7 kB
xet
about 1 month ago
58ea8db7
data_utils.md
6.87 kB
xet
about 1 month ago
b9f7793a
dataset_formats.html
223 kB
xet
about 1 month ago
3291e066
dataset_formats.md
40.2 kB
xet
about 1 month ago
380d4c62
deepspeed_integration.html
15.8 kB
xet
about 1 month ago
8cc454f1
deepspeed_integration.md
1.48 kB
xet
about 1 month ago
96460e48
distillation_trainer.html
161 kB
xet
about 1 month ago
e7cf2003
distillation_trainer.md
12.3 kB
xet
about 1 month ago
26b8824d
distributing_training.html
96.5 kB
xet
about 1 month ago
77396dc2
distributing_training.md
21 kB
xet
about 1 month ago
400cb97c
dpo_trainer.html
233 kB
xet
about 1 month ago
4088277a
dpo_trainer.md
30.8 kB
xet
about 1 month ago
68400a15
example_overview.html
42.8 kB
xet
about 1 month ago
b5226e75
example_overview.md
19.7 kB
xet
about 1 month ago
0e24a4b0
experimental_overview.html
10.8 kB
xet
about 1 month ago
686d8516
experimental_overview.md
1.6 kB
xet
about 1 month ago
d2f8cf27
favicon.png
1.57 kB
xet
about 1 month ago
6e06dd7b
gfpo.html
85 kB
xet
about 1 month ago
0aea82b3
gfpo.md
5.24 kB
xet
about 1 month ago
24005932
gkd_trainer.html
119 kB
xet
about 1 month ago
9665e89f
gkd_trainer.md
14.1 kB
xet
about 1 month ago
0b4136af
gold_trainer.html
166 kB
xet
about 1 month ago
f569620e
gold_trainer.md
18.1 kB
xet
about 1 month ago
d9fbf5be
grpo_trainer.html
561 kB
xet
about 1 month ago
fcf3062d
grpo_trainer.md
55.9 kB
xet
about 1 month ago
f19e50ae
grpo_with_replay_buffer.html
90.3 kB
xet
about 1 month ago
97611570
grpo_with_replay_buffer.md
6.16 kB
xet
about 1 month ago
3c56b073
gspo_token.html
50.2 kB
xet
about 1 month ago
ecf0a2bb
gspo_token.md
4.27 kB
xet
about 1 month ago
7611db21
index.html
29 kB
xet
about 1 month ago
e96729ff
index.md
3.66 kB
xet
about 1 month ago
0c94ab19
installation.html
13.9 kB
xet
about 1 month ago
1e32443b
installation.md
721 Bytes
xet
about 1 month ago
93a212e0
jobs_training.html
32.9 kB
xet
about 1 month ago
f4cdaf2b
jobs_training.md
6.44 kB
xet
about 1 month ago
7e612e1d
kernels_hub.html
26.8 kB
xet
about 1 month ago
3db02d6e
kernels_hub.md
4.18 kB
xet
about 1 month ago
6982a1e3
kto_trainer.html
138 kB
xet
about 1 month ago
765256f7
kto_trainer.md
18.3 kB
xet
about 1 month ago
23da0afa
liger_kernel_integration.html
15 kB
xet
about 1 month ago
72eed58c
liger_kernel_integration.md
2.2 kB
xet
about 1 month ago
0ab2ea33
llms-full.txt
868 kB
xet
about 1 month ago
cd247930
llms.txt
5.15 kB
xet
about 1 month ago
6a6e1693
lora_without_regret.html
45 kB
xet
about 1 month ago
e6fd7cb6
lora_without_regret.md
14.4 kB
xet
about 1 month ago
c3be6b18
merge_model_callback.html
17.8 kB
xet
about 1 month ago
46709609
merge_model_callback.md
1.19 kB
xet
about 1 month ago
3e41a652
minillm_trainer.html
180 kB
xet
about 1 month ago
03d65f72
minillm_trainer.md
16.4 kB
xet
about 1 month ago
8ac5eed0
nash_md_trainer.html
125 kB
xet
about 1 month ago
7d449e90
nash_md_trainer.md
16.3 kB
xet
about 1 month ago
ea85fca2
nemo_gym.html
58.5 kB
xet
about 1 month ago
18afebf1
nemo_gym.md
10.3 kB
xet
about 1 month ago
b5b12cc0
objects.inv
2.45 kB
xet
about 1 month ago
d6a99119
online_dpo_trainer.html
180 kB
xet
about 1 month ago
c310d20c
online_dpo_trainer.md
23.1 kB
xet
about 1 month ago
6007d98f
openenv.html
114 kB
xet
about 1 month ago
7d4004ca
openenv.md
27.8 kB
xet
about 1 month ago
c6910c62
orpo_trainer.html
127 kB
xet
about 1 month ago
abe71d03
orpo_trainer.md
17.7 kB
xet
about 1 month ago
5d5494c4
paper_index.html
742 kB
xet
about 1 month ago
4e09de42
paper_index.md
19.1 kB
xet
about 1 month ago
593d08aa
papo_trainer.html
112 kB
xet
about 1 month ago
a379355a
papo_trainer.md
8.82 kB
xet
about 1 month ago
86a16afe
peft_integration.html
131 kB
xet
about 1 month ago
4fc34420
peft_integration.md
23.9 kB
xet
about 1 month ago
e9e89590
ppo_trainer.html
279 kB
xet
about 1 month ago
be81b8e3
ppo_trainer.md
41.4 kB
xet
about 1 month ago
03f52e51
prm_trainer.html
118 kB
xet
about 1 month ago
a18a411e
prm_trainer.md
13.9 kB
xet
about 1 month ago
690ef528
ptt_integration.html
29.3 kB
xet
about 1 month ago
21304933
ptt_integration.md
4.27 kB
xet
about 1 month ago
6a21357a
quickstart.html
37.4 kB
xet
about 1 month ago
c22997bf
quickstart.md
3.25 kB
xet
about 1 month ago
1c4455a3
rapidfire_integration.html
95.4 kB
xet
about 1 month ago
41acfe92
rapidfire_integration.md
17.7 kB
xet
about 1 month ago
08fed5af
reducing_memory_usage.html
52.4 kB
xet
about 1 month ago
4fb2a466
reducing_memory_usage.md
12.3 kB
xet
about 1 month ago
d61af82e
Load more
Sync this bucket
Mount this bucket
Total size
190 GB
Files
3,011,154
Last updated
May 30
Pre-warmed CDN
US
EU
US
EU
Contributors