mlfoundations-dev/instruction_filtering_fast_text_classifier_seed_science_pos_scp
Updated
mlfoundations-dev/instruction_filtering_fast_text_classifier_seed_science_pos_wikipedia_biology
Updated
mlfoundations-dev/instruction_filtering_fast_text_classifier_seed_science_pos_fineweb
Updated
mlfoundations-dev/instruction_filtering_fast_text_classifier_seed_science_pos_expertqa
Updated
mlfoundations-dev/instruction_filtering_fast_text_classifier_seed_science_pos_anderson_chemistry
Updated
mlfoundations-dev/DCFT-filter_math_neg_s1_1_gemini-etash
8B • Updated • 1
mlfoundations-dev/DCFT-swe_gym_annotate_with_patch-etash
8B • Updated • 1
mlfoundations-dev/Qwen2.5-7B-stratos_verified_mix-megatron
8B • Updated mlfoundations-dev/global_batchsize_512_lradjusted8_warmup05
Text Generation
• 8B • Updated • 1
mlfoundations-dev/DCFT-science_verification_scale_up_random_fix-etash
8B • Updated • 1
mlfoundations-dev/DCFT-filter_code_pos_majority-etash
8B • Updated • 1
mlfoundations-dev/DCFT-filter_code_pos_codeforces_gpt-etash
8B • Updated • 1
mlfoundations-dev/DCFT-filter_code_baseline-etash
8B • Updated • 1
mlfoundations-dev/weight_decay_10
Text Generation
• 8B • Updated mlfoundations-dev/weight_decay_05
Text Generation
• 8B • Updated mlfoundations-dev/weight_decay_02
Text Generation
• 8B • Updated mlfoundations-dev/weight_decay_15
Text Generation
• 8B • Updated mlfoundations-dev/DCFT-ot-114k_general-thought-feb-25-etash
8B • Updated • 1
mlfoundations-dev/hero_run_2_fix_conversations_32b
Updated
mlfoundations-dev/Bespoke-Stratos-17k
Text Generation
• 8B • Updated • 3
mlfoundations-dev/DCFT-general-thought-feb-25-etash
8B • Updated mlfoundations-dev/DCFT-science_verification_scale_up_random-etash
8B • Updated mlfoundations-dev/DCFT-swe_gym_annotate-etash
8B • Updated mlfoundations-dev/DCFT-science_verification_scale_up_majority_consensus-etash
8B • Updated mlfoundations-dev/DCFT-science_verification_scale_up_gpt_verification-etash
8B • Updated • 1
mlfoundations-dev/DCFT-science_verification_scale_up_all-etash
8B • Updated mlfoundations-dev/qwen_lawma_filtered_deepseek-2k-5x
Text Generation
• 8B • Updated • 2
mlfoundations-dev/hero_run_2_fix_conversations
Text Generation
• 8B • Updated mlfoundations-dev/DCFT-seed_code_r1_glaive-etash
8B • Updated • 1
mlfoundations-dev/qwen2-5_multiple_samples_ground_truth_openr1_llm_verifier_clean
Text Generation
• 0.5B • Updated • 6