·
AI & ML interests
None yet
Organizations
anmolagarwal999/Qwen2.5-3B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_30
Text Generation
• 3B • Updated
• 2
anmolagarwal999/Qwen2.5-3B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_10
Text Generation
• 3B • Updated
• 1
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_410
Text Generation
• 0.5B • Updated
• 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_390
Text Generation
• 0.5B • Updated
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_378
Text Generation
• 0.5B • Updated
• 5
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_360
Text Generation
• 0.5B • Updated
• 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_340
Text Generation
• 0.5B • Updated
• 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_330
Text Generation
• 0.5B • Updated
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_310
Text Generation
• 0.5B • Updated
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_294
Text Generation
• 0.5B • Updated
• 2
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_280
Text Generation
• 0.5B • Updated
• 1
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_260
Text Generation
• 0.5B • Updated
• 1
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_250
Text Generation
• 0.5B • Updated
• 1
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_230
Text Generation
• 0.5B • Updated
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_210
Text Generation
• 0.5B • Updated
• 4
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_190
Text Generation
• 0.5B • Updated
• 3
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_170
Text Generation
• 0.5B • Updated
• 1
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_160
Text Generation
• 0.5B • Updated
• 1
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_140
Text Generation
• 0.5B • Updated
• 6
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_126
Text Generation
• 0.5B • Updated
• 4
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_110
Text Generation
• 0.5B • Updated
• 4
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_90
Text Generation
• 0.5B • Updated
• 5
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_80
Text Generation
• 0.5B • Updated
• 5
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_60
Text Generation
• 0.5B • Updated
• 5
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_42
Text Generation
• 0.5B • Updated
• 1
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_30
Text Generation
• 0.5B • Updated
• 1
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_10
Text Generation
• 0.5B • Updated
• 5
anmolagarwal999/babel-sft_run_Qwen_Qwen2.5-0.5B-Instruct_4300_256_1_10_20-global_step_90
0.6B • Updated
anmolagarwal999/babel-sft_run_Qwen_Qwen2.5-0.5B-Instruct_4300_256_1_10_20-global_step_85
0.6B • Updated
anmolagarwal999/babel-sft_run_Qwen_Qwen2.5-0.5B-Instruct_4300_256_1_10_20-global_step_80
0.6B • Updated