Add comprehensive LLM finetuning analysis with 7 visualization graphs 9d429ce havinashpatil commited on 11 days ago
Final hackathon submission: polished README + detailed blog writeup 90be6c7 havinashpatil commited on 11 days ago
Fix builtin syntax fixer for inline one-line headers and validate TGI fixes 0c0a8ff havinashpatil commited on 11 days ago
Fix HF Spaces TGI launch entrypoint to avoid unexpected argument error 27a8fcf havinashpatil commited on 11 days ago
Fix TGI endpoint plumbing and use full TGI runtime image in Dockerfile a8bc575 havinashpatil commited on 11 days ago
Add AI coding system with local Hugging Face LLM integration 271cc02 havinashpatil commited on 11 days ago
Update Dockerfile to build frontend for Spaces deployment da3a03d havinashpatil commited on 11 days ago
Serve React frontend from FastAPI backend for one-click access 62c7e95 havinashpatil commited on 11 days ago
Finalizing CodeArena RL Benchmark: frontend improvements, GRPO training scripts, and cleaned environment 03a7eb9 havinashpatil commited on 11 days ago
fix: clamp reward to [0.01,0.99] so .2f never rounds to 0.00 or 1.00 59fd9d3 havinashpatil commited on 12 days ago
fix: removed invalid openenv-py package from notebook install cell 82e39c9 havinashpatil commited on 12 days ago
feat: use m-a-p/Code-Feedback dataset for GRPO training 9204c04 havinashpatil commited on 12 days ago
chore: update dependencies and include training results for README 8599a81 havinashpatil commited on 12 days ago
Complete all tasks: Adaptive curriculum, GRPO, React frontend, LLM-as-a-judge a448db8 havinashpatil commited on 12 days ago
fix: reset task_id parsing, grader tuple crash fallback, and inference score output 646409d adityanaikhpt commited on 28 days ago
fix: exclusive score clamping (0.001, 0.999) for evaluator 2d8d38c adityanaikhpt commited on 28 days ago
Rewrite inference.py for strict OpenEnv parsing + add httpx eb60bd2 adityanaikhpt commited on 28 days ago
fix: add safe_reward() clamp at every reward return point f14f8d9 adityanaikhpt commited on 28 days ago
Minimal patch: standalone proxy ping + reward clamped to (0,1) 74bfde0 adityanaikhpt commited on 28 days ago
Add normalize_reward(); enforce strict (0,1) on all paths b53855f adityanaikhpt commited on 28 days ago
fix: use API_BASE_URL/API_KEY for LiteLLM proxy β always make API call (Phase 2) 51fdbe8 adityanaikhpt commited on 29 days ago
fix: make inference.py crash-proof when OPENAI_API_KEY is missing (Phase 2) 1fe26af adityanaikhpt commited on 29 days ago
fix: OpenEnv multi-mode compliance β add main() entrypoint and uv.lock e92bfc1 adityanaikhpt commited on 29 days ago
Production-ready: add server/app.py with fallback-safe /reset, fix Dockerfile, add HF metadata, add task JSON files dcc8fa3 adityanaikhpt commited on 29 days ago