AI & ML interests
None defined yet.
test-gen/code_mbpp_qwen2.5-7b_t1.0_n8_tests_mbpp_qwen3-1.7b-easy-unique_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/code_mbpp_qwen2.5-7b_t1.0_n8_tests_mbpp_qwen3-1.7b-unique_lr1e-5_t0.0_n1
Viewer
• Updated
• 500 • 5
test-gen/code_mbpp_qwen2.5-7b_t1.0_n8_tests_mbpp_qwen3-1.7b-unique_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/code_mbpp_qwen2.5-7b_t1.0_n8_tests_mbpp_qwen3-1.7b-easy_lr1e-5_t0.0_n1
Viewer
• Updated
• 500 • 5
test-gen/code_mbpp_qwen2.5-7b_t1.0_n8_tests_mbpp_qwen3-1.7b-easy_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/code_mbpp_qwen2.5-7b_t1.0_n8_tests_mbpp_qwen3-0.6b-unique_lr1e-5_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/code_mbpp_qwen2.5-7b_t1.0_n8_tests_mbpp_qwen3-0.6b-easy_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/code_mbpp_qwen2.5-7b_t1.0_n8_tests_mbpp_qwen3-0.6b-random_lr1e-5_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/code_mbpp_qwen2.5-7b_t1.0_n8_tests_mbpp_qwen2-0.5b-easy-unique_lr1e-6_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/code_humaneval_qwen2.5-7b_t1.0_n8_tests_humaneval_qwen2-3b-easy-unique_lr1e-6_t0.0_n1
Viewer
• Updated
• 164 • 6
test-gen/code_humaneval_qwen2.5-14b_t1.0_n8_tests_humaneval_qwen3-4b_t0.6_n1_think
Viewer
• Updated
• 164 • 6
test-gen/code_humaneval_qwen2.5-14b_t1.0_n8_tests_humaneval_o4-mini_t0_n1
Viewer
• Updated
• 164 • 6
test-gen/code_humaneval_qwen2.5-14b_t1.0_n8_tests_humaneval_o3_t0_n1
Viewer
• Updated
• 164 • 5
test-gen/code_humaneval_qwen2.5-3b_t1.0_n8_tests_humaneval_qwen-7b-easy_t0.0_n1
Viewer
• Updated
• 164 • 6
test-gen/code_humaneval_qwen2.5-3b_t1.0_n8_tests_humaneval_qwen3-4b_t0.6_n1_think
Viewer
• Updated
• 164 • 6
test-gen/code_humaneval_qwen2.5-3b_t1.0_n8_tests_humaneval_o4-mini_t0_n1
Viewer
• Updated
• 164 • 5
test-gen/code_humaneval_qwen2.5-3b_t1.0_n8_tests_humaneval_o3_t0_n1
Viewer
• Updated
• 164 • 5
test-gen/code_humaneval_qwen2.5-1.5b_t1.0_n8_tests_humaneval_qwen-7b-easy_t0.0_n1
Viewer
• Updated
• 164 • 6
test-gen/code_humaneval_qwen2.5-1.5b_t1.0_n8_tests_humaneval_qwen3-4b_t0.6_n1_think
Viewer
• Updated
• 164 • 5
test-gen/code_humaneval_qwen2.5-1.5b_t1.0_n8_tests_humaneval_o4-mini_t0_n1
Viewer
• Updated
• 164 • 6
test-gen/code_humaneval_qwen2.5-1.5b_t1.0_n8_tests_humaneval_o3_t0_n1
Viewer
• Updated
• 164 • 6
test-gen/code_humaneval_qwen2.5-0.5b_t1.0_n8_tests_humaneval_qwen-7b-easy_t0.0_n1
Viewer
• Updated
• 164 • 6
test-gen/code_humaneval_qwen2.5-0.5b_t1.0_n8_tests_humaneval_qwen3-4b_t0.6_n1_think
Viewer
• Updated
• 164 • 6
test-gen/code_humaneval_qwen2.5-0.5b_t1.0_n8_tests_humaneval_o4-mini_t0_n1
Viewer
• Updated
• 164 • 5
test-gen/code_humaneval_qwen2.5-0.5b_t1.0_n8_tests_humaneval_o3_t0_n1
Viewer
• Updated
• 164 • 6
test-gen/code_mbpp_qwen2.5-32b_t1.0_n8_tests_mbpp_qwen-7b-easy_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/code_mbpp_qwen2.5-32b_t1.0_n8_tests_mbpp_qwen3-4b_t0.6_n1_think
Viewer
• Updated
• 500 • 5
test-gen/code_mbpp_qwen2.5-32b_t1.0_n8_tests_mbpp_o4-mini_t0_n1
Viewer
• Updated
• 500 • 6
test-gen/code_mbpp_qwen2.5-32b_t1.0_n8_tests_mbpp_o3_t0_n1
Viewer
• Updated
• 500 • 5
test-gen/code_mbpp_qwen2.5-14b_t1.0_n8_tests_mbpp_qwen-7b-easy_t0.0_n1
Viewer
• Updated
• 500 • 6