mlfoundations-dev/DCFT-S1-R1-32B
Updated
mlfoundations-dev/difficulty_sorting_easy_seed_math
Text Generation
• 8B • Updated • 5
mlfoundations-dev/difficulty_sorting_random_seed_math
Text Generation
• 8B • Updated • 3
mlfoundations-dev/difficulty_sorting_medium_seed_math
Text Generation
• 8B • Updated • 3
mlfoundations-dev/difficulty_sorting_high_seed_math
Text Generation
• 8B • Updated • 2
mlfoundations-dev/s1K_reformat_v2
Text Generation
• 8B • Updated • 1
mlfoundations-dev/LIMO_OLD
Text Generation
• 8B • Updated • 1
mlfoundations-dev/unverified_stratos_mix_no_proofs_without_metadata
Text Generation
• 8B • Updated • 2
mlfoundations-dev/verified_stratos_mix_no_proofs_without_metadata
Text Generation
• 8B • Updated • 14
mlfoundations-dev/multiple_samples_none_numina_aime
Text Generation
• 8B • Updated • 3
mlfoundations-dev/multiple_samples_sharpening_numina_aime
Text Generation
• 8B • Updated • 2
mlfoundations-dev/dpo_from_multiple_samples_shortest_numina_aime
Text Generation
• 8B • Updated • 1
mlfoundations-dev/dpo_from_stratos_judged_annotated_rejected_responses
Text Generation
• 8B • Updated • 3
• 1
mlfoundations-dev/seed_math_math_instruct_reasoninghp
Text Generation
• 8B • Updated mlfoundations-dev/seed_math_open2math_reasoninghp
Text Generation
• 8B • Updated • 7
mlfoundations-dev/seed_math_automathtext_reasoninghp
Text Generation
• 8B • Updated • 2
mlfoundations-dev/multiple_samples_majority_consensus_pick_one_numina_aime_math_verify
Text Generation
• 8B • Updated • 3
mlfoundations-dev/seed_math_tiger_math_reasoninghp
Text Generation
• 8B • Updated • 1
mlfoundations-dev/multiple_samples_majority_consensus_numina_aime_math_verify
Text Generation
• 8B • Updated • 1
mlfoundations-dev/32k_test_dummy
Text Generation
• 8B • Updated • 2
mlfoundations-dev/mlfoundations-dev_stratos-verified-mix-scaled-1_stratos_7b
Text Generation
• 8B • Updated • 4
mlfoundations-dev/mlfoundations-dev_stratos-unverified-mix-scaled-0_5_stratos_7b
Text Generation
• 8B • Updated • 1
mlfoundations-dev/OpenThinker_7B_32k
Updated
mlfoundations-dev/dolphinr1
Text Generation
• 8B • Updated • 2
• 2
mlfoundations-dev/llama3-1_8b_multiple_samples_majority_consensus_numina_aime
Text Generation
• 8B • Updated • 1
mlfoundations-dev/llama3-1_8b_multiple_samples_all_numina_aime
Text Generation
• 8B • Updated • 8
mlfoundations-dev/llama3-1_8b_multiple_samples_random_numina_aime
Text Generation
• 8B • Updated • 2
mlfoundations-dev/llama3-1_8b_multiple_samples_shortest_numina_aime
Text Generation
• 8B • Updated • 2
mlfoundations-dev/mlfoundations-dev_stratos-verified-mix-scaled-0_5_stratos_7b
Text Generation
• 8B • Updated • 3
mlfoundations-dev/llama3-1_8b_r1_annotated_math
Text Generation
• 8B • Updated • 1