mlfoundations-dev/small_subset_clean_sandboxes
Updated • 20
mlfoundations-dev/clean-sandboxes-tasks
Updated • 141
mlfoundations-dev/tbench_oracle_solutions
Viewer
• Updated • 233 • 5
mlfoundations-dev/sandboxes-tasks-hello-world
mlfoundations-dev/test-tbench-traces
Viewer
• Updated • 7.8k • 55
mlfoundations-dev/phi_27K_qwq_3K_lambda_0.9
Viewer
• Updated • 30k • 4
mlfoundations-dev/phi_30K_qwq_0K_temp2
Viewer
• Updated • 30k • 58
mlfoundations-dev/e1_science_ms_phi_temp2
Viewer
• Updated • 31.6k • 66
mlfoundations-dev/e1_code_fasttext_phi_temp2
Viewer
• Updated • 30.4k • 71
mlfoundations-dev/tbench_traces_local_sharegptv1
Viewer
• Updated • 1.19k • 74
mlfoundations-dev/terminal-bench-traces-local
Viewer
• Updated • 1.19k • 108
mlfoundations-dev/tbench_traces_sharegptv1
Viewer
• Updated • 7.8k • 71
• 1
mlfoundations-dev/e1_math_all_phi_temp4
Viewer
• Updated • 31.6k • 75
mlfoundations-dev/d1_code_load_in_phi_temp_2
Viewer
• Updated • 63.2k • 73
mlfoundations-dev/d1_science_load_in_phi_temp2
Viewer
• Updated • 63.2k • 60
mlfoundations-dev/claude_3_7_20250219_tbench_traces_sharegptv1
Viewer
• Updated • 820 • 66
mlfoundations-dev/claude_3_7_tbench_traces_sharegptv1
Viewer
• Updated • 74 • 66
mlfoundations-dev/claude_3_7_tbench_traces_sharegpt
Viewer
• Updated • 74 • 64
mlfoundations-dev/e1_math_all_phi_temp2
Viewer
• Updated • 31.6k • 55
mlfoundations-dev/d1_math_load_in_phi_temp4
Viewer
• Updated • 63.2k • 68
mlfoundations-dev/d1_math_load_in_phi_temp2
Viewer
• Updated • 63.2k • 65
mlfoundations-dev/claude_3_7_tbench_traces
Viewer
• Updated • 71 • 55
mlfoundations-dev/OpenReasoning-Nemotron-1.5B_eval_8179
Viewer
• Updated • 12.2k • 117
mlfoundations-dev/OpenReasoning-Nemotron-7B_eval_8179
Viewer
• Updated • 12.2k • 175
mlfoundations-dev/e1_math_all_phi_temp40
Viewer
• Updated • 31.6k • 48
mlfoundations-dev/e1_math_all_phi_temp20
Viewer
• Updated • 31.6k • 46
mlfoundations-dev/e1_math_all_phi_temp10
Viewer
• Updated • 31.6k • 49
mlfoundations-dev/d1_science_load_in_phi_temp40
Viewer
• Updated • 63.2k • 54
mlfoundations-dev/d1_science_load_in_phi_temp20
Viewer
• Updated • 63.2k • 51
mlfoundations-dev/d1_science_load_in_phi_temp10
Viewer
• Updated • 63.2k • 51