CohenQu/POPE-hard-dataset-Qwen3-4B-Instruct-32k-128-filtered-iter2 Viewer • Updated Oct 7, 2025 • 1.12k • 4
CohenQu/POPE-hard-dataset-Qwen3-4B-Instruct-32k-128-filtered-gemini-success Viewer • Updated Oct 7, 2025 • 216 • 1
CohenQu/POPE-hard-dataset-Qwen3-4B-Instruct-32k-128-filtered Viewer • Updated Oct 7, 2025 • 1.33k • 9
CohenQu/POPE-source-dataset-Qwen3-4B-Instruct-eval-32k-16_all Viewer • Updated Oct 6, 2025 • 2.52k • 10
CohenQu/Continue_vs_Terminate.06.eval_prediction.09.23.step1 Viewer • Updated Sep 23, 2025 • 5.56k • 1
CohenQu/Continue_vs_Terminate.06.eval_prediction.09.22.step3 Viewer • Updated Sep 23, 2025 • 5.06k • 1
CohenQu/Continue_vs_Terminate.06.eval_prediction.09.22.step2 Viewer • Updated Sep 23, 2025 • 67.5k • 1
CohenQu/Continue_vs_Terminate.06.eval_prediction.09.23.step3 Viewer • Updated Sep 23, 2025 • 5.2k • 10
CohenQu/Continue_vs_Terminate.06.eval_prediction.09.22.step1 Viewer • Updated Sep 22, 2025 • 1.46k • 2