AI & ML interests
None defined yet.
test-gen/code_humaneval_qwen2.5-3b_t0.1_n8_tests_humaneval_qwen3-0.6b-easy-unique_lr1e-5_t0.0_n1
Viewer
• Updated
• 164 • 6
test-gen/mbpp_DeepSeek-R1-Distill-Qwen-32B_t0.6_n1_think_generated_tests
Viewer
• Updated
• 500 • 5
test-gen/humaneval_qwen3-0.6b-easy-unique_lr1e-5_t0.0_n1_generated_tests
Viewer
• Updated
• 164 • 5
test-gen/code_humaneval_qwen2.5-3b_t0.1_n8_tests_humaneval_qwen3-0.6b-unique_lr1e-5_t0.0_n1
Viewer
• Updated
• 164 • 5
test-gen/humaneval_qwen3-0.6b-unique_lr1e-5_t0.0_n1_generated_tests
Viewer
• Updated
• 164 • 4
test-gen/code_humaneval_qwen2.5-3b_t0.1_n8_tests_humaneval_qwen3-0.6b-easy_lr1e-5_t0.0_n1
Viewer
• Updated
• 164 • 6
test-gen/humaneval_qwen3-0.6b-easy_lr1e-5_t0.0_n1_generated_tests
Viewer
• Updated
• 164 • 5
test-gen/code_mbpp_qwen2.5-3b_t0.1_n8_tests_mbpp_qwen3-8b-easy-unique_lr1e-5_t0.0_n1
Viewer
• Updated
• 500 • 5
test-gen/mbpp_qwen3-8b-easy-unique_lr1e-5_t0.0_n1_generated_tests
Viewer
• Updated
• 500 • 4
test-gen/code_mbpp_qwen2.5-3b_t0.1_n8_tests_mbpp_qwen3-8b-unique_lr1e-5_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/mbpp_qwen3-8b-unique_lr1e-5_t0.0_n1_generated_tests
Viewer
• Updated
• 500 • 5
test-gen/code_mbpp_qwen2.5-3b_t0.1_n8_tests_mbpp_qwen3-8b-easy_lr1e-5_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/mbpp_qwen3-8b-easy_lr1e-5_t0.0_n1_generated_tests
Viewer
• Updated
• 500 • 4
test-gen/code_mbpp_qwen2.5-3b_t0.1_n8_tests_mbpp_qwen3-8b-random_lr1e-5_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/mbpp_DeepSeek-R1-Distill-Qwen-14B_t0.6_n1_think_generated_tests
Viewer
• Updated
• 500 • 4
test-gen/mbpp_qwen3-8b-random_lr1e-5_t0.0_n1_generated_tests
Viewer
• Updated
• 500 • 5
test-gen/code_mbpp_qwen2.5-3b_t0.1_n8_tests_mbpp_qwen3-4b-easy-unique_lr1e-5_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/mbpp_DeepSeek-R1-Distill-Qwen-7B_t0.6_n1_think_generated_tests
Viewer
• Updated
• 500 • 5
test-gen/mbpp_qwen3-4b-easy-unique_lr1e-5_t0.0_n1_generated_tests
Viewer
• Updated
• 500 • 5
test-gen/code_mbpp_qwen2.5-3b_t0.1_n8_tests_mbpp_qwen3-4b-unique_lr1e-5_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/mbpp_qwen3-4b-unique_lr1e-5_t0.0_n1_generated_tests
Viewer
• Updated
• 500 • 4
test-gen/mbpp_DeepSeek-R1-Distill-Qwen-1.5B_t0.6_n1_think_generated_tests
Viewer
• Updated
• 500 • 5
test-gen/code_mbpp_qwen2.5-3b_t0.1_n8_tests_mbpp_qwen3-4b-easy_lr1e-5_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/mbpp_qwen3-4b-easy_lr1e-5_t0.0_n1_generated_tests
Viewer
• Updated
• 500 • 5
test-gen/code_mbpp_qwen2.5-3b_t0.1_n8_tests_mbpp_qwen3-4b-random_lr1e-5_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/mbpp_qwen3-4b-random_lr1e-5_t0.0_n1_generated_tests
Viewer
• Updated
• 500 • 4
test-gen/livecodebench_DeepSeek-R1-Distill-Qwen-32B_t0.6_n1_think_generated_tests
Viewer
• Updated
• 182 • 6
test-gen/code_mbpp_qwen2.5-3b_t0.1_n8_tests_mbpp_qwen3-1.7b-easy-unique_lr1e-5_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/mbpp_qwen3-1.7b-easy-unique_lr1e-5_t0.0_n1_generated_tests
Viewer
• Updated
• 500 • 5
test-gen/code_mbpp_qwen2.5-3b_t0.1_n8_tests_mbpp_qwen3-1.7b-unique_lr1e-5_t0.0_n1
Viewer
• Updated
• 500 • 6