AI & ML interests
None defined yet.
test-gen/code_livecodebench_qwen2.5-3b_t0.1_n8_tests_livecodebench_qwen3-32b_t0.6_n1
Viewer
• Updated
• 182 • 7
test-gen/code_livecodebench_qwen2.5-3b_t0.1_n8_tests_livecodebench_qwen3-14b_t0.6_n1
Viewer
• Updated
• 182 • 6
test-gen/code_livecodebench_qwen2.5-3b_t0.1_n8_tests_livecodebench_qwen3-8b_t0.6_n1
Viewer
• Updated
• 182 • 7
test-gen/code_livecodebench_qwen2.5-3b_t0.1_n8_tests_livecodebench_qwen3-4b_t0.6_n1
Viewer
• Updated
• 182 • 6
test-gen/code_livecodebench_qwen2.5-3b_t0.1_n8_tests_livecodebench_qwen3-0.6b_t0.6_n1
Viewer
• Updated
• 182 • 6
test-gen/mbpp_Qwen3-8B_t0.6_n1_think_generated_tests
Viewer
• Updated
• 500 • 5
test-gen/mbpp_Qwen3-4B_t0.6_n1_think_generated_tests
Viewer
• Updated
• 500 • 5
test-gen/mbpp_Qwen3-0.6B_t0.6_n1_think_generated_tests
Viewer
• Updated
• 500 • 5
test-gen/livecodebench_Qwen3-32B_t0.6_n1_think_generated_tests
Viewer
• Updated
• 182 • 6
test-gen/livecodebench_Qwen3-14B_t0.6_n1_think_generated_tests
Viewer
• Updated
• 182 • 6
test-gen/livecodebench_Qwen3-8B_t0.6_n1_think_generated_tests
Viewer
• Updated
• 182 • 6
test-gen/code_mbpp_qwen2.5-3b_t0.1_n8_tests_mbpp_qwen2.5-32b_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/code_mbpp_qwen2.5-3b_t0.1_n8_tests_mbpp_qwen2.5-14b_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/livecodebench_Qwen3-4B_t0.6_n1_think_generated_tests
Viewer
• Updated
• 182 • 5
test-gen/livecodebench_Qwen3-0.6B_t0.6_n1_think_generated_tests
Viewer
• Updated
• 182 • 6
test-gen/code_livecodebench_qwen2.5-3b_t0.1_n8_tests_livecodebench_qwen2.5-32b_t0.0_n1
Viewer
• Updated
• 182 • 7
test-gen/code_mbpp_qwen2.5-3b_t0.1_n8_tests_mbpp_qwen2.5-7b_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/code_mbpp_qwen2.5-3b_t0.1_n8_tests_mbpp_qwen2.5-3b_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/code_mbpp_qwen2.5-3b_t0.1_n8_tests_mbpp_qwen2.5-1.5b_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/code_mbpp_qwen2.5-3b_t0.1_n8_tests_mbpp_qwen2.5-0.5b_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/code_humaneval_qwen2.5-3b_t0.1_n8_tests_humaneval_qwen2.5-32b_t0.0_n1
Viewer
• Updated
• 164 • 6
test-gen/code_humaneval_qwen2.5-3b_t0.1_n8_tests_humaneval_qwen2.5-14b_t0.0_n1
Viewer
• Updated
• 164 • 6
test-gen/code_humaneval_qwen2.5-3b_t0.1_n8_tests_humaneval_qwen2.5-7b_t0.0_n1
Viewer
• Updated
• 164 • 6
test-gen/code_humaneval_qwen2.5-3b_t0.1_n8_tests_humaneval_qwen2.5-3b_t0.0_n1
Viewer
• Updated
• 164 • 6
test-gen/code_humaneval_qwen2.5-3b_t0.1_n8_tests_humaneval_qwen2.5-1.5b_t0.0_n1
Viewer
• Updated
• 164 • 6
test-gen/code_humaneval_qwen2.5-3b_t0.1_n8_tests_humaneval_qwen2.5-0.5b_t0.0_n1
Viewer
• Updated
• 164 • 6
test-gen/code_livecodebench_qwen2.5-3b_t0.1_n8_tests_livecodebench_qwen2.5-14b_t0.0_n1
Viewer
• Updated
• 182 • 6
test-gen/code_livecodebench_qwen2.5-3b_t0.1_n8_tests_livecodebench_qwen2.5-7b_t0.0_n1
Viewer
• Updated
• 182 • 7
test-gen/code_livecodebench_qwen2.5-3b_t0.1_n8_tests_livecodebench_qwen2.5-3b_t0.0_n1
Viewer
• Updated
• 182 • 7
test-gen/code_livecodebench_qwen2.5-3b_t0.1_n8_tests_livecodebench_qwen2.5-1.5b_t0.0_n1
Viewer
• Updated
• 182 • 6