mlfoundations-dev/global_batchsize_512_lradjusted16
Text Generation
• 8B • Updated • 1
mlfoundations-dev/2k_chunk_Bespoke-Stratos-17k
8B • Updated • 1
mlfoundations-dev/global_batchsize_512_lradjusted32_constant
Text Generation
• 8B • Updated • 3
mlfoundations-dev/global_batchsize_512_lradjusted64
Text Generation
• 8B • Updated • 1
mlfoundations-dev/ds_no_offload_liger_packing_dataloader1
Text Generation
• 8B • Updated • 1
mlfoundations-dev/ds_no_offload_liger_packing_dataloader2
Text Generation
• 8B • Updated • 1
mlfoundations-dev/openthoughts_with_speedups
Text Generation
• 8B • Updated • 1
mlfoundations-dev/ds_no_offload_liger_packing_dataloader4
Text Generation
• 8B • Updated • 1
mlfoundations-dev/ds_no_offload_liger_packing_dataloader32
Text Generation
• 8B • Updated • 1
mlfoundations-dev/ds_no_offload_liger_packing_dataloader16
Text Generation
• 8B • Updated • 2
mlfoundations-dev/global_batchsize_512_lradjusted8
Text Generation
• 8B • Updated • 1
mlfoundations-dev/DCFT-pos_neg_ablation_instruction_filtering_seed_math_worst_5-etash
8B • Updated mlfoundations-dev/DCFT-pos_neg_ablation_instruction_filtering_seed_math_worst_3-etash
8B • Updated mlfoundations-dev/DCFT-pos_neg_ablation_instruction_filtering_seed_math_best_openr1math-etash
8B • Updated • 1
mlfoundations-dev/global_batchsize_512_lradjusted32_warmup05
Text Generation
• 8B • Updated • 2
mlfoundations-dev/ds_no_offload_liger_packing_zero2
Text Generation
• 8B • Updated mlfoundations-dev/DCFT-pos_neg_ablation_instruction_filtering_seed_math_worst_1-etash
8B • Updated • 1
mlfoundations-dev/DCFT-pos_neg_ablation_instruction_filtering_seed_math_best_olympiad-etash
8B • Updated mlfoundations-dev/DCFT-pos_neg_ablation_instruction_filtering_seed_math_best_mix-etash
8B • Updated mlfoundations-dev/DCFT-pos_neg_ablation_instruction_filtering_seed_math_best_automath-etash
8B • Updated • 1
mlfoundations-dev/DCFT-pos_neg_ablation_instruction_filtering_seed_code_worst_mix_8_5_3_1-etash
8B • Updated • 3
mlfoundations-dev/DCFT-pos_neg_ablation_instruction_filtering_seed_code_worst_mix_5_3_1-etash
8B • Updated • 1
mlfoundations-dev/DCFT-pos_neg_ablation_instruction_filtering_seed_code_worst_mix_3_1-etash
8B • Updated • 1
mlfoundations-dev/DCFT-pos_neg_ablation_instruction_filtering_seed_code_worst_mix_12_8_5_3_1-etash
8B • Updated • 1
mlfoundations-dev/DCFT-pos_neg_ablation_instruction_filtering_seed_code_worst_8-etash
8B • Updated • 3
mlfoundations-dev/DCFT-pos_neg_ablation_instruction_filtering_seed_code_worst_5-etash
8B • Updated mlfoundations-dev/DCFT-pos_neg_ablation_instruction_filtering_seed_code_worst_3-etash
8B • Updated • 1
mlfoundations-dev/DCFT-pos_neg_ablation_instruction_filtering_seed_code_worst_12-etash
8B • Updated • 1
mlfoundations-dev/DCFT-pos_neg_ablation_instruction_filtering_seed_code_best_sharegpt-etash
8B • Updated • 1
mlfoundations-dev/DCFT-pos_neg_ablation_instruction_filtering_seed_code_best_baseline-etash
8B • Updated