Nil01002/G2B_on_CODE_MBPP_H_774_subset_wrt_W_354_BS_64_lr_2e-5_epoch10_linear_schedule Updated Dec 1, 2025
Nil01002/G2B_on_CODE_MBPP_G27B_IT_G_601_subset_wrt_W_354_BS_64_lr_2e-5_epoch10_linear_schedule Updated Dec 1, 2025
Nil01002/Llama8B_on_human_gold_paraphrased_by_G27BIT_batch256_lr1e-6_warmup0.1_10_epoch_linear_lr Updated Sep 29, 2025
Nil01002/Llama8B_on_Gemma27B_gold_DEDUP_5809_batch256_lr1e-6_warmup0.1_10_epoch_linear_lr Updated Sep 20, 2025
Nil01002/Llama8B_on_countdown_Gemma27B_wrong_batch256_lr2e-5_warmup0.1_10_epoch_linear_lr Updated Sep 19, 2025
Nil01002/Llama8B_on_countdown_Gemma27B_gold_batch256_lr2e-5_warmup0.1_10_epoch_linear_lr Updated Sep 19, 2025
Nil01002/Qwen1.5B_FF_on_countdown_Gemma27B_wrong_batch64_lr2e-5_warmup0.1_linear_lr Updated Sep 18, 2025
Nil01002/Qwen1.5B_FF_on_countdown_Gemma27B_gold_batch64_lr2e-5_warmup0.1_linear_lr Updated Sep 18, 2025
Nil01002/llama8B_FF_gsm8k_Gemma27B_wrong_5944_batch256_lr10e-6_warmup0.1_116_epoch_linear_lr Updated Sep 14, 2025
Nil01002/llama8B_FF_gsm8k_Gemma27B_gold_6913_batch256_lr10e-6_warmup0.1_10_epoch_linear_lr Updated Sep 14, 2025
Nil01002/gemma2_2B_FF_gemini_flash_gold_7114_batch256_lr10e-6_warmup0.1_max_tokens_1024 Updated Sep 14, 2025
Nil01002/qwen2.5_1.5B_FF_gemini_flash_gold_7114_batch256_lr10e-6_warmup0.1_max_tokens_1024 Updated Sep 2, 2025