Ctrl+K
- clip_0.2_grpo_r8
- clip_0.2_ppo
- clip_0_4_grpo_r8
- clip_lr_2e-6_0.2_grpo_r8
- deepscaler_qwen3_1p7b_p-diff-adv
- kl_0_lr_1e6_clip_4_p-diff-adv
- lambda_2_p-diff-adv
- lr_1e6_clip_35_temp_1_1_p-diff-adv
- lr_1e6_kl_002_p-diff-adv
- lr_2e-7_clip_0_2_k_4_p-diff-adv-pass-at-k
- lr_5e-7_clip_0_2_k_4_p-diff-adv-pass-at-k
- lr_5e-7_clip_0_2_lambda_1_p-diff-adv
- lr_5e-7_clip_0_4_lambda_1_p-diff-adv
- lr_7e-7_clip_0_2_k_4_p-diff-adv-pass-at-k
- lr_7e7_kl_001_p-diff-adv
- mini_batch_1
- mini_batch_1_pass_at_4.0_p-diff-adv-pass-at-k
- p-diff-lambda-0
- pvalue
- ref_20_lr_1e-6_clip_0_4_lambda_1_p-diff-adv
- resume_lr_5e-7_clip_2_p-diff-adv
- resume_lr_5e-7_p-diff-adv
- temp_0_6_kl_005_clip_2_p-diff-adv
- 16.6 kB