CL-From-Nothing/grpo_code_hard_code_rose_sft_25K_warmup-parquet_qwen3-1.7b_epoch_1_mask_resp16384-T1.0-n8 2B • Updated 5 days ago • 19
CL-From-Nothing/grpo_code_hard_code_rose_sft_25K_warmup-parquet_qwen3-1.7b_epoch_1_mask_resp16384-T1.0-n8 2B • Updated 5 days ago • 19
CL-From-Nothing/rl_warm_up_code_rose_sft_25K_1_7B_SFT-parquet_qwen3-1.7b-base-code-10k-sft_epoch_1_mask_lr1e-5 2B • Updated 6 days ago • 10
CL-From-Nothing/rl_warm_up_code_rose_sft_25K_1_7B_SFT-parquet_qwen3-1.7b-base-code-10k-sft_epoch_1_mask_lr1e-5 2B • Updated 6 days ago • 10
CL-From-Nothing/code_rose_initial_1_7B_SFT_10K_rollouts_Qwen3-4B-Thinking-2507_k12_t0.7_maxtok12288 Viewer • Updated 7 days ago • 87k • 41
CL-From-Nothing/code_rose_initial_1_7B_SFT_10K_rollouts_Qwen3-4B-Thinking-2507_k12_t0.7_maxtok12288 Viewer • Updated 7 days ago • 87k • 41
CL-From-Nothing/sft_warmup_OpenCodeReasoning_10K-parquet_qwen3-1.7b-base_epoch_1_mask_lr1e-5 2B • Updated 9 days ago • 12
CL-From-Nothing/sft_warmup_OpenCodeReasoning_10K-parquet_qwen3-1.7b-base_epoch_1_mask_lr1e-5 2B • Updated 9 days ago • 12
CL-From-Nothing/grpo_pope_rlve_qwen3-1.7b_step_112_resp16384-T1.0-n8 Text Generation • 2B • Updated 11 days ago • 14
CL-From-Nothing/grpo_pope_rlve_qwen3-1.7b_step_112_resp16384-T1.0-n8 Text Generation • 2B • Updated 11 days ago • 14