penfever/a3-rl-DCAgent_selfinstruct-naive-sandboxes-2-verified Viewer • Updated 26 days ago • 44.4k • 48
penfever/nl2bash-tasks-cleaned-oracle-minimax-m27-131k-traces Viewer • Updated 27 days ago • 1.57k • 39
penfever/a3-rl-laion_nemotron-gym-knowledge-web-search-mcqa Viewer • Updated 27 days ago • 39.9k • 53
penfever/exp_rpt_methods2test-large-v3-minimax-m27-131k-traces Viewer • Updated 27 days ago • 4.45k • 36
penfever/a3-rl-laion_nemotron-gym-math-advanced-calculations-v3 Viewer • Updated 27 days ago • 41.9k • 33
penfever/a3-rl-laion_nemotron-gym-instruction-following-structured Viewer • Updated 28 days ago • 56k • 30
penfever/nemotron-code-oracle-filtered-minimax-m27-131k-traces Viewer • Updated 29 days ago • 10.9k • 44