anmolagarwal999/Qwen2.5-3B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_30 Text Generation • 3B • Updated May 6, 2025 • 1
anmolagarwal999/Qwen2.5-3B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_10 Text Generation • 3B • Updated May 6, 2025
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_410 Text Generation • 0.5B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_390 Text Generation • 0.5B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_378 Text Generation • 0.5B • Updated May 6, 2025
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_360 Text Generation • 0.5B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_340 Text Generation • 0.5B • Updated May 6, 2025 • 3
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_330 Text Generation • 0.5B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_310 Text Generation • 0.5B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_294 Text Generation • 0.5B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_280 Text Generation • 0.5B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_260 Text Generation • 0.5B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_250 Text Generation • 0.5B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_230 Text Generation • 0.5B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_210 Text Generation • 0.5B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_190 Text Generation • 0.5B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_170 Text Generation • 0.5B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_160 Text Generation • 0.5B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_140 Text Generation • 0.5B • Updated May 6, 2025
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_126 Text Generation • 0.5B • Updated May 6, 2025 • 1
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_110 Text Generation • 0.5B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_90 Text Generation • 0.5B • Updated May 6, 2025
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_80 Text Generation • 0.5B • Updated May 6, 2025
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_60 Text Generation • 0.5B • Updated May 6, 2025
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_42 Text Generation • 0.5B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_30 Text Generation • 0.5B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_10 Text Generation • 0.5B • Updated May 6, 2025
anmolagarwal999/babel-sft_run_Qwen_Qwen2.5-0.5B-Instruct_4300_256_1_10_20-global_step_90 0.6B • Updated Apr 25, 2025 • 1
anmolagarwal999/babel-sft_run_Qwen_Qwen2.5-0.5B-Instruct_4300_256_1_10_20-global_step_85 0.6B • Updated Apr 25, 2025 • 1
anmolagarwal999/babel-sft_run_Qwen_Qwen2.5-0.5B-Instruct_4300_256_1_10_20-global_step_80 0.6B • Updated Apr 25, 2025 • 1