mlfoundations-dev/am_0.3k_eval_08c7
Viewer
• Updated • 600 • 3
mlfoundations-dev/nemo_nano_1k_eval_08c7
Viewer
• Updated • 600 • 1
mlfoundations-dev/am_eval_08c7
Viewer
• Updated • 600 • 2
mlfoundations-dev/am_300k_eval_08c7
Viewer
• Updated • 600 • 2
mlfoundations-dev/limo_0.3k_eval_08c7
Viewer
• Updated • 600 • 2
mlfoundations-dev/am_1000k_eval_08c7
Viewer
• Updated • 600 • 1
mlfoundations-dev/s1_0.3k_eval_08c7
Viewer
• Updated • 600 • 2
mlfoundations-dev/s1_eval_08c7
Viewer
• Updated • 600 • 2
mlfoundations-dev/Llama-3.1-8B-Instruct_eval_5554
Viewer
• Updated • 22.7k • 2
mlfoundations-dev/openthoughts3_science_eval_2e29
Viewer
• Updated • 23.9k • 20
mlfoundations-dev/a1_science_stackexchange_physics_1k_eval_636d
Viewer
• Updated • 7.9k • 7
mlfoundations-dev/b1_science_top_2_10k_eval_636d
Viewer
• Updated • 7.9k • 28
mlfoundations-dev/openthoughts3_code_force_stop
Viewer
• Updated • 16 • 2
mlfoundations-dev/DeepSeek-R1-0528-Qwen3-8B_eval_5554
Viewer
• Updated • 22.7k • 4
• 1
mlfoundations-dev/OpenThinker-32B-Unverified_eval_5554
Viewer
• Updated • 22.7k • 2
mlfoundations-dev/OpenThinker2-7B_1748464817_eval_2870
Viewer
• Updated • 300 • 2
mlfoundations-dev/OpenThinker-7B-Unverified_eval_5554
Viewer
• Updated • 22.7k • 2
mlfoundations-dev/eval-lawma-tasks-qwen_lawma_deepseek-2k-5x-majority_verified
Viewer
• Updated • 1.52k • 30
mlfoundations-dev/verified_stratos_mix_no_proofs_without_metadata_eval_5554
Viewer
• Updated • 22.7k • 4
mlfoundations-dev/AceReason-Nemotron-14B_eval_5554
Viewer
• Updated • 45.4k • 2
mlfoundations-dev/OpenThinker-32B_eval_08c7
Viewer
• Updated • 300 • 2
mlfoundations-dev/OpenThinker-7B_eval_08c7
Viewer
• Updated • 300 • 2
mlfoundations-dev/decontamination_study
Viewer
• Updated • 6.09k • 4
mlfoundations-dev/openthoughts3_math_100k_eval_2e29
Viewer
• Updated • 23.9k • 2
mlfoundations-dev/meta_chat_reasoning_25_75_eval_2e29
Viewer
• Updated • 23.9k • 2
mlfoundations-dev/deepmath_eval_2e29
Viewer
• Updated • 23.9k • 4
mlfoundations-dev/meta_chat_reasoning_25_75_eval_636d
Viewer
• Updated • 7.9k • 2
mlfoundations-dev/openthoughts3_100k_eval_636d
Viewer
• Updated • 7.9k • 2
mlfoundations-dev/meta_chat_reasoning_50_50_system_eval_bba0
Viewer
• Updated • 7.9k • 2
mlfoundations-dev/meta_chat_reasoning_25_75_system_eval_bba0
Viewer
• Updated • 7.9k • 4