nuprl-staging/bug_injection_completions_4493644 Viewer • Updated about 21 hours ago • 1.47k • 12
nuprl-staging/bug_injection_completions_4493644 Viewer • Updated about 21 hours ago • 1.47k • 12
PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models Paper • 2502.01584 • Published Feb 3, 2025 • 9