YYT-t/math_math-Mistral-7B-Instruct-v0.2-rs-sample_7500_temp_1.0_gen_30_mlr5e-5 7B • Updated May 3 • 6
YYT-t/math_gsm-Mistral-7B-Instruct-v0.2-rs-sample_7500_temp_1.0_gen_30_mlr5e-5 7B • Updated May 3 • 4
YYT-t/math_gsm-Meta-Llama-3-8B-Instruct-rs-sample_7500_temp_1.0_gen_30_mlr5e-5 8B • Updated May 2 • 4
YYT-t/math_math-Meta-Llama-3-8B-Instruct-rs-sample_7500_temp_1.0_gen_30_mlr5e-5 8B • Updated May 2 • 5
YYT-t/soft_thinking_distill_gsm8k_train_results_s0_e10000000_mr8192_mp512 Viewer • Updated 2 days ago • 87.1k • 12