LLucass
/

FF_L0.2_H0.2_dr_grpo

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

FF_L0.2_H0.2_dr_grpo / checkpoint-200

28.4 GB

Ctrl+K

Ctrl+K

1 contributor

History: 1 commit

LLucass's picture

Training in progress, step 200, checkpoint

313f6bc verified 11 months ago

global_step200
Training in progress, step 200, checkpoint 11 months ago
config.json

704 Bytes
Training in progress, step 200, checkpoint 11 months ago
generation_config.json

181 Bytes
Training in progress, step 200, checkpoint 11 months ago
latest

14 Bytes
Training in progress, step 200, checkpoint 11 months ago
model.safetensors

3.55 GB
xet

Training in progress, step 200, checkpoint 11 months ago
rng_state_0.pth

15 kB
xet

Training in progress, step 200, checkpoint 11 months ago
rng_state_1.pth

15 kB
xet

Training in progress, step 200, checkpoint 11 months ago
rng_state_2.pth

15 kB
xet

Training in progress, step 200, checkpoint 11 months ago
rng_state_3.pth

15 kB
xet

Training in progress, step 200, checkpoint 11 months ago
scheduler.pt

1.06 kB
xet

Training in progress, step 200, checkpoint 11 months ago
special_tokens_map.json

485 Bytes
Training in progress, step 200, checkpoint 11 months ago
tokenizer.json

11.4 MB
xet

Training in progress, step 200, checkpoint 11 months ago
tokenizer_config.json

6.77 kB
Training in progress, step 200, checkpoint 11 months ago
trainer_state.json

210 kB
Training in progress, step 200, checkpoint 11 months ago
training_args.bin

8.89 kB
xet

Training in progress, step 200, checkpoint 11 months ago
zero_to_fp32.py

33.3 kB
Training in progress, step 200, checkpoint 11 months ago