Serve React frontend from FastAPI backend for one-click access 62c7e95 havinashpatil commited on 17 days ago
Finalizing CodeArena RL Benchmark: frontend improvements, GRPO training scripts, and cleaned environment 03a7eb9 havinashpatil commited on 17 days ago
fix: clamp reward to [0.01,0.99] so .2f never rounds to 0.00 or 1.00 59fd9d3 havinashpatil commited on 17 days ago
fix: removed invalid openenv-py package from notebook install cell 82e39c9 havinashpatil commited on 17 days ago
feat: use m-a-p/Code-Feedback dataset for GRPO training 9204c04 havinashpatil commited on 17 days ago
chore: update dependencies and include training results for README 8599a81 havinashpatil commited on 17 days ago
Complete all tasks: Adaptive curriculum, GRPO, React frontend, LLM-as-a-judge a448db8 havinashpatil commited on 18 days ago
fix: reset task_id parsing, grader tuple crash fallback, and inference score output 646409d adityanaikhpt commited on Apr 8
Minimal patch: standalone proxy ping + reward clamped to (0,1) 74bfde0 adityanaikhpt commited on Apr 8
fix: use API_BASE_URL/API_KEY for LiteLLM proxy β always make API call (Phase 2) 51fdbe8 adityanaikhpt commited on Apr 8
fix: make inference.py crash-proof when OPENAI_API_KEY is missing (Phase 2) 1fe26af adityanaikhpt commited on Apr 8
fix: OpenEnv multi-mode compliance β add main() entrypoint and uv.lock e92bfc1 adityanaikhpt commited on Apr 8
Production-ready: add server/app.py with fallback-safe /reset, fix Dockerfile, add HF metadata, add task JSON files dcc8fa3 adityanaikhpt commited on Apr 8