Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
hf-doc-build
/
doc-dev
Follow
HuggingFace Doc Builds
34
Files
xet
hf-doc-build/doc-dev
/
trl
/
pr_5320
/
en
190 GB
3,006,032 files
Updated about 2 hours ago
Ctrl+K
Name
Size
Uploaded
Xet hash
_app
about 2 months ago
139 items
_toctree.yml
3.22 kB
xet
about 2 months ago
ca36b740
async_grpo_trainer.html
109 kB
xet
about 2 months ago
b0e7c7b2
async_grpo_trainer.md
12.3 kB
xet
about 2 months ago
b084b59f
bco_trainer.html
131 kB
xet
about 2 months ago
b22be4ff
bco_trainer.md
14.4 kB
xet
about 2 months ago
69449aba
bema_for_reference_model.html
90 kB
xet
about 2 months ago
1dca992b
bema_for_reference_model.md
7.18 kB
xet
about 2 months ago
7c1cd464
callbacks.html
95.8 kB
xet
about 2 months ago
fe0198fc
callbacks.md
7.88 kB
xet
about 2 months ago
61b4dfa4
chat_template_utils.html
43.9 kB
xet
about 2 months ago
d15b3b9d
chat_template_utils.md
4.65 kB
xet
about 2 months ago
5a002fb7
clis.html
53.4 kB
xet
about 2 months ago
7c367581
clis.md
12.8 kB
xet
about 2 months ago
e6db15c7
community_tutorials.html
31 kB
xet
about 2 months ago
f86fa671
community_tutorials.md
11 kB
xet
about 2 months ago
f2dd5755
cpo_trainer.html
146 kB
xet
about 2 months ago
c4dbf443
cpo_trainer.md
23.2 kB
xet
about 2 months ago
54060617
customization.html
31 kB
xet
about 2 months ago
e955339e
customization.md
4.08 kB
xet
about 2 months ago
fafca3dd
data_utils.html
62.9 kB
xet
about 2 months ago
2590476e
data_utils.md
6.32 kB
xet
about 2 months ago
907d43e5
dataset_formats.html
223 kB
xet
about 2 months ago
f8537767
dataset_formats.md
40.2 kB
xet
about 2 months ago
65066422
deepspeed_integration.html
15.8 kB
xet
about 2 months ago
d62e0162
deepspeed_integration.md
1.48 kB
xet
about 2 months ago
96460e48
distributing_training.html
96.5 kB
xet
about 2 months ago
96c469c2
distributing_training.md
21 kB
xet
about 2 months ago
400cb97c
dpo_trainer.html
231 kB
xet
about 2 months ago
76e3acae
dpo_trainer.md
30.6 kB
xet
about 2 months ago
3ee9fcff
example_overview.html
39.6 kB
xet
about 2 months ago
a495506f
example_overview.md
19.5 kB
xet
about 2 months ago
365fe293
experimental_overview.html
10.8 kB
xet
about 2 months ago
af2fa072
experimental_overview.md
1.6 kB
xet
about 2 months ago
d2f8cf27
favicon.png
1.57 kB
xet
about 2 months ago
6e06dd7b
gfpo.html
84.4 kB
xet
about 2 months ago
d4b99e6a
gfpo.md
5.24 kB
xet
about 2 months ago
6881e2ed
gkd_trainer.html
119 kB
xet
about 2 months ago
34164cba
gkd_trainer.md
14.1 kB
xet
about 2 months ago
06140d81
gold_trainer.html
156 kB
xet
about 2 months ago
49b133ba
gold_trainer.md
16.8 kB
xet
about 2 months ago
35797af1
grpo_trainer.html
554 kB
xet
about 2 months ago
20135ecd
grpo_trainer.md
53.8 kB
xet
about 2 months ago
66f2aab3
grpo_with_replay_buffer.html
89.6 kB
xet
about 2 months ago
ed56d410
grpo_with_replay_buffer.md
6.16 kB
xet
about 2 months ago
54b19439
gspo_token.html
50.2 kB
xet
about 2 months ago
e103189d
gspo_token.md
4.27 kB
xet
about 2 months ago
91c2a017
index.html
28.8 kB
xet
about 2 months ago
9d0b3302
index.md
3.68 kB
xet
about 2 months ago
55bdb9e3
installation.html
13.9 kB
xet
about 2 months ago
66850d1f
installation.md
721 Bytes
xet
about 2 months ago
93a212e0
jobs_training.html
32.9 kB
xet
about 2 months ago
838d867a
jobs_training.md
6.44 kB
xet
about 2 months ago
7e612e1d
judges.html
122 kB
xet
about 2 months ago
dc32f9cf
judges.md
15.8 kB
xet
about 2 months ago
ee25f07c
kernels_hub.html
26.8 kB
xet
about 2 months ago
2e1d6db7
kernels_hub.md
4.18 kB
xet
about 2 months ago
6982a1e3
kto_trainer.html
138 kB
xet
about 2 months ago
871b41f8
kto_trainer.md
19.4 kB
xet
about 2 months ago
de0342c8
liger_kernel_integration.html
15 kB
xet
about 2 months ago
68835a3c
liger_kernel_integration.md
2.2 kB
xet
about 2 months ago
0ab2ea33
llms-full.txt
809 kB
xet
about 2 months ago
5ea7b6df
llms.txt
4.91 kB
xet
about 2 months ago
77ecf77b
lora_without_regret.html
45 kB
xet
about 2 months ago
840855e2
lora_without_regret.md
14.4 kB
xet
about 2 months ago
c3be6b18
merge_model_callback.html
17.8 kB
xet
about 2 months ago
dfae77ac
merge_model_callback.md
1.19 kB
xet
about 2 months ago
9edb4b4e
minillm_trainer.html
179 kB
xet
about 2 months ago
047cb616
minillm_trainer.md
16.4 kB
xet
about 2 months ago
9badc48a
nash_md_trainer.html
130 kB
xet
about 2 months ago
cfc071c6
nash_md_trainer.md
17.1 kB
xet
about 2 months ago
9592edbc
nemo_gym.html
58.5 kB
xet
about 2 months ago
bce4906c
nemo_gym.md
10.3 kB
xet
about 2 months ago
b5b12cc0
objects.inv
2.37 kB
xet
about 2 months ago
70a2c641
online_dpo_trainer.html
195 kB
xet
about 2 months ago
e12f706e
online_dpo_trainer.md
26.3 kB
xet
about 2 months ago
cb13069a
openenv.html
88.2 kB
xet
about 2 months ago
48b0c585
openenv.md
25.4 kB
xet
about 2 months ago
c0037347
orpo_trainer.html
129 kB
xet
about 2 months ago
4966391e
orpo_trainer.md
17.9 kB
xet
about 2 months ago
a2b35124
paper_index.html
717 kB
xet
about 2 months ago
5e0c2a58
paper_index.md
13.8 kB
xet
about 2 months ago
49725c37
papo_trainer.html
111 kB
xet
about 2 months ago
8f7eb829
papo_trainer.md
8.82 kB
xet
about 2 months ago
8cd55f23
peft_integration.html
131 kB
xet
about 2 months ago
694c2622
peft_integration.md
23.9 kB
xet
about 2 months ago
502969ca
ppo_trainer.html
280 kB
xet
about 2 months ago
c566282e
ppo_trainer.md
42 kB
xet
about 2 months ago
cca23124
prm_trainer.html
117 kB
xet
about 2 months ago
69be1433
prm_trainer.md
13.9 kB
xet
about 2 months ago
7ec32728
ptt_integration.html
29.3 kB
xet
about 2 months ago
6317564d
ptt_integration.md
4.27 kB
xet
about 2 months ago
6a21357a
quickstart.html
37.4 kB
xet
about 2 months ago
0224ce80
quickstart.md
3.25 kB
xet
about 2 months ago
1c4455a3
rapidfire_integration.html
81.1 kB
xet
about 2 months ago
708205c1
rapidfire_integration.md
12.8 kB
xet
about 2 months ago
c14e6d99
reducing_memory_usage.html
52.4 kB
xet
about 2 months ago
4612ff88
reducing_memory_usage.md
12.3 kB
xet
about 2 months ago
d61af82e
reward_trainer.html
175 kB
xet
about 2 months ago
2198c812
reward_trainer.md
21.5 kB
xet
about 2 months ago
af4e3dae
Load more
Sync this bucket
Mount this bucket
Total size
190 GB
Files
3,006,032
Last updated
May 30
Pre-warmed CDN
US
EU
US
EU
Contributors