xw1234gan/Merging_Qwen2.5-7B-Instruct_MMLU_lr1e-05_mb2_ga128_n2048_seed42 Text Generation • 8B • Updated 16 days ago • 24
xw1234gan/Fixed_Merging_Qwen2.5-7B-Instruct_MMLU_lr1e-05_mb2_ga128_n2048_seed42 Text Generation • 8B • Updated 17 days ago • 27
xw1234gan/GRPO_KL_Qwen2.5-7B-Instruct_MMLU_beta0.01_lr1e-05_mb2_ga128_n2048_seed42 Text Generation • 8B • Updated 18 days ago • 25
xw1234gan/cnk12_GRPO_KL_Qwen2.5-7B-Instruct_beta0.01_lr1e-05_mb2_ga128_n2048_seed42 Text Generation • 8B • Updated 19 days ago • 25
xw1234gan/olympiads_Adaptive_Merging_Qwen2.5-1.5B-Instruct_lr1e-05_mb2_ga128_n2048_seed42 Text Generation • 2B • Updated 21 days ago • 25