perf(ui): fewer Gradio steps + live STEPS_PER_EPISODE, tighter token cap 26e10b8 sanjay7676 commited on Apr 26
fix(api): Pydantic v2 step payload (exclude_none); env candidate_solutions; README Space API curl guide; Gradio API tab 2b35bd5 sanjay7676 commited on Apr 26
Fix DPO Dataset Generation: Real preference pairs now exported from adversarial loop 96f35ba sanjay7676 commited on Apr 25
Final Submission Upgrade: Advanced tier progression, professional README, and hackathon blog bb6d47c sanjay7676 commited on Apr 25
Final cleanup for FORGE-v4: Colab entrypoint, OpenEnv API, 10x optimization, and Judge Narrative generation 3978c05 sanjay7676 commited on Apr 25