v2.0 frontend: multi-step investigation UI with step tracker, progressive reveal, and reasoning bonus 8483903 Siteshcodes commited on Apr 12
v2.0: multi-step episodes, procedural bugs, semantic grading, sessions, 71 tests 703aa57 Siteshcodes commited on Apr 12
feat: serve frontend at root / so it shows in HF Spaces App tab, JSON status moved to /health ca5a648 Siteshcodes commited on Apr 10
feat: add interactive demo frontend at /web — no existing endpoints changed 787a5a5 Siteshcodes commited on Apr 10
fix: add name/difficulty to tasks, per-task [START]/[END] logs for validator 7eb0325 Siteshcodes commited on Apr 10
fix: replace 0.0 fallback with 0.05 in graders to satisfy strict range bc79ac5 Siteshcodes commited on Apr 9
fix: add openai to requirements - was causing silent import failure 86c4bbe Siteshcodes commited on Apr 8
fix: add get_metadata with tasks and graders to Environment class 46680d3 Siteshcodes commited on Apr 8
upgrade environment.py: done guard, fix tasks_completed, sample_bug ca4e18e Siteshcodes commited on Apr 2
upgrade task.py: milestone grading, team in medium, 5 bugs per task 442df7c Siteshcodes commited on Apr 2