Add line numbers to code display, fix clear_flag loop, match submission spec exactly 6535d0b codemaverick2 commited on 2 days ago
Fix stdout to [START]/[STEP]/[END] format, use OpenAI client with HF defaults d3e536b codemaverick2 commited on 2 days ago
Add diversity/exploration bonuses, near-miss type check, context truncation 78f3eb2 codemaverick2 commited on 2 days ago
Add 7-task RL env with PBRS, CAMRL curriculum, VL norm, RC-GRPO inference e48a1e4 codemaverick2 commited on 2 days ago