Commit History

v2.0: multi-step episodes, procedural bugs, semantic grading, sessions, 71 tests
703aa57

Siteshcodes commited on

feat: serve frontend at root / so it shows in HF Spaces App tab, JSON status moved to /health
ca5a648

Siteshcodes commited on

feat: add interactive demo frontend at /web — no existing endpoints changed
787a5a5

Siteshcodes commited on

fix: add name/difficulty to tasks, per-task [START]/[END] logs for validator
7eb0325

Siteshcodes commited on

fix: stateful endpoints + score clamping for validator pass
6174aa3

Siteshcodes commited on

fix: no exact 0.0 or 1.0 anywhere in rewards
2fbe4d0

Siteshcodes commited on

fix: reward_range 0.05-0.95 and proper descriptions
926a06f

Siteshcodes commited on

fix: correct grader import paths
a1396d9

Siteshcodes commited on

fix: tasks returns plain array for validator
89bfee5

Siteshcodes commited on

fix: override /reset to accept task_id for validator
db98438

Siteshcodes commited on

fix: force override /metadata with tasks for validator
74c6ebb

Siteshcodes commited on

fix: override /metadata with tasks and graders for validator
b781553

Siteshcodes commited on

fix: add /grader and /baseline endpoints for validator
319cfcd

Siteshcodes commited on

fix: add task endpoints and per-task reset for validator
44b9283

Siteshcodes commited on

fix server/app.py
246501f

Siteshcodes commited on

fix: add root health check route
c700066

Siteshcodes commited on

fix pyproject.toml and generate uv.lock
e15d96a

Siteshcodes commited on

fix import paths for Docker
666b49a

Siteshcodes commited on

complete bug triage openenv environment
38ab410

Siteshcodes commited on