Commit History

Upload folder using huggingface_hub
913bd30
verified

rwlinno commited on

Update repo README with model card
4aaf620
verified

rwlinno commited on

Add grpo-scae-qwen35-9b (Qwen3.5-9B, GRPO+SCAE, step 949, LoRA r=64 alpha=128)
062721f
verified

rwlinno commited on

Add opd-topoprm-qwen35-9b-v2 (Qwen3.5-9B, OPD Stage3, step 50, LoRA r=64 alpha=128)
006b9a6
verified

rwlinno commited on

Add opd-topoprm-dr1-7b-v2 (DR1-7B, OPD Stage3, step 200, LoRA r=64 alpha=128)
65ca91f
verified

rwlinno commited on

Add grpo-topoprm-qwen35-9b (Qwen3.5-9B, GRPO+TopoPRM, step 50, LoRA r=64 alpha=128)
6782c17
verified

rwlinno commited on

Add grpo-topoprm-dr1-7b (DR1-7B, GRPO+TopoPRM, step 100, LoRA r=64 alpha=128)
f98a04a
verified

rwlinno commited on

Add sft-dr1-7b-final checkpoint
b95d744
verified

rwlinno commited on

Upload folder using huggingface_hub
19342f2
verified

rwlinno commited on

Upload folder using huggingface_hub
0edeb9a
verified

rwlinno commited on

Upload folder using huggingface_hub
7e4f6bd
verified

rwlinno commited on

Upload folder using huggingface_hub
c85c854
verified

rwlinno commited on

Upload folder using huggingface_hub
7952407
verified

rwlinno commited on

Upload folder using huggingface_hub
bb658e9
verified

rwlinno commited on

Upload folder using huggingface_hub
41085e7
verified

rwlinno commited on

Upload grpo_hier_9b_ckpt79/args.json
874522c
verified

rwlinno commited on

Upload grpo_hier_9b_ckpt79/additional_config.json
41fce26
verified

rwlinno commited on

Upload grpo_hier_9b_ckpt79/scheduler.pt
390e9e4
verified

rwlinno commited on

Upload grpo_hier_9b_ckpt79/adapter_config.json
0f89e53
verified

rwlinno commited on

Upload grpo_hier_9b_ckpt79/rng_state.pth
960329a
verified

rwlinno commited on

Upload grpo_hier_9b_ckpt79/adapter_model.safetensors
e0759d8
verified

rwlinno commited on

Upload grpo_hier_9b_ckpt79/training_args.bin
853d7a6
verified

rwlinno commited on

Upload grpo_hier_9b_ckpt79/trainer_state.json
f060daf
verified

rwlinno commited on

Upload grpo_hier_9b_ckpt79/optimizer.pt
6991c52
verified

rwlinno commited on

Upload grpo_topoprm_dr1_7b_ckpt949/args.json
43c70e7
verified

rwlinno commited on

Upload grpo_topoprm_dr1_7b_ckpt949/additional_config.json
24e404b
verified

rwlinno commited on

Upload grpo_topoprm_dr1_7b_ckpt949/adapter_config.json
494267a
verified

rwlinno commited on

Upload grpo_topoprm_dr1_7b_ckpt949/scheduler.pt
f7a497b
verified

rwlinno commited on

Upload grpo_topoprm_dr1_7b_ckpt949/training_args.bin
7b8285f
verified

rwlinno commited on

Upload grpo_topoprm_dr1_7b_ckpt949/rng_state.pth
465919c
verified

rwlinno commited on

Upload grpo_topoprm_dr1_7b_ckpt949/adapter_model.safetensors
530c40e
verified

rwlinno commited on

Upload opd_dr1_7b_stage3_v2_ckpt200/trainer_state.json
a60f58b
verified

rwlinno commited on

Upload opd_dr1_7b_stage3_v2_ckpt200/optimizer.pt
a38b74f
verified

rwlinno commited on

Upload opd_dr1_7b_stage3_v2_ckpt200/args.json
9c57301
verified

rwlinno commited on

Upload opd_dr1_7b_stage3_v2_ckpt200/additional_config.json
c78fd68
verified

rwlinno commited on

Upload opd_dr1_7b_stage3_v2_ckpt200/adapter_config.json
7b113cc
verified

rwlinno commited on

Upload opd_dr1_7b_stage3_v2_ckpt200/scheduler.pt
4ef78aa
verified

rwlinno commited on

Upload opd_dr1_7b_stage3_v2_ckpt200/training_args.bin
8dc077f
verified

rwlinno commited on

Upload opd_dr1_7b_stage3_v2_ckpt200/rng_state.pth
a019451
verified

rwlinno commited on

Upload opd_dr1_7b_stage3_v2_ckpt200/adapter_model.safetensors
397061d
verified

rwlinno commited on

Upload opd_dr1_7b_stage3_ckpt200/trainer_state.json
d8cd626
verified

rwlinno commited on

Upload opd_dr1_7b_stage3_ckpt200/optimizer.pt
e57039c
verified

rwlinno commited on

Upload opd_dr1_7b_stage3_ckpt200/args.json
1b0a1a5
verified

rwlinno commited on

Upload opd_dr1_7b_stage3_ckpt200/additional_config.json
98dd346
verified

rwlinno commited on

Upload opd_dr1_7b_stage3_ckpt200/adapter_config.json
85a4402
verified

rwlinno commited on

Upload opd_dr1_7b_stage3_ckpt200/scheduler.pt
a60738e
verified

rwlinno commited on

Upload opd_dr1_7b_stage3_ckpt200/training_args.bin
e512801
verified

rwlinno commited on

Upload opd_dr1_7b_stage3_ckpt200/rng_state.pth
2015714
verified

rwlinno commited on

Upload opd_dr1_7b_stage3_ckpt200/adapter_model.safetensors
508d70f
verified

rwlinno commited on

Upload opd_qwen25_7b_stage3_ckpt200/trainer_state.json
cd3a198
verified

rwlinno commited on