penfever/meta-llama_Llama-3.1-70B-Instruct-jdgfct-Factuality Viewer • Updated Apr 15 • 1M • 210
penfever/meta-llama_Llama-3.1-70B-Instruct-jdgfct-Completeness Viewer • Updated Apr 14 • 985k • 11
penfever/meta-llama_Llama-3.1-8B-Instruct-jdgfct-Harmlessness Viewer • Updated Apr 13 • 984k • 5
penfever/meta-llama_Llama-3.1-8B-Instruct-jdgfct-Completeness Viewer • Updated Apr 13 • 984k • 6
penfever/meta-llama_Llama-3.1-8B-Instruct-jdgfct-Conciseness Viewer • Updated Apr 13 • 984k • 219
penfever/meta-llama_Llama-3.1-8B-Instruct-jdgfct-Readability Viewer • Updated Apr 13 • 984k • 5
penfever/rl__64GPU_base_32b__nl2bash-tasks-cleaned-oracle__syh-r2eg-askl-glm_4__40-0 Updated Apr 4 • 3
penfever/rl__24GPU_shaped__stackexchange-overflow-sandboxes-skywork-response__exp_tas_optimal_comb__40-0 Viewer • Updated Apr 1 • 41.8k • 5
penfever/rl__24GPU_shaped__inferredbugs-sandboxes-verifier__exp_tas_optimal_comb__40-0 Viewer • Updated Mar 26 • 30.8k • 5
penfever/rl__64GPU_shaped_32b_entropy__swe_rebench_patched_oracle__syh-r2eg-askl-glm_4__40-0 Viewer • Updated Mar 26 • 8.51k • 57
penfever/rl__24GPU_shaped__nemotron-math-oracle-filtered__exp_tas_optimal_comb__40-0 Viewer • Updated Mar 26 • 22.4k • 5
penfever/Kimi-2.5-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-32k-reward1 Viewer • Updated Mar 26 • 5.24k • 12
penfever/Kimi-2.5-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-32k Viewer • Updated Mar 26 • 9.36k • 35
penfever/rl__24GPU_shaped_entropy__swe_rebench_patched_oracle__100k_wd0__Qwen3-8B__20-0 Viewer • Updated Mar 26 • 9.97k • 80
penfever/rl__24GPU_shaped__selfinstruct-naive-sandboxes-2-verified__exp_tas_optimal_comb__40-0 Viewer • Updated Mar 26 • 30.2k • 5
penfever/rl__24GPU_shaped_entropy__nemotron-math-oracle-filtered__100k_wd0 Viewer • Updated Mar 25 • 6.16k • 3
penfever/stackexchange-tezos-sandboxes__Kimi-2.5-smaxeps-32k Viewer • Updated Mar 23 • 8.62k • 26
penfever/rl__24GPU_shaped__exp_rpt_pymethods2test-large__GLM-4_7-swesmith-san Viewer • Updated Mar 23 • 21.8k • 6