fix: restore frontend-backend compatibility with dual response keys cfb0d65 vivekvish2004 commited on 4 days ago
fix: backend bugs — /state double call, /step stale check, missing openenv-core b5dacbd vivekvish2004 commited on 4 days ago
feat: proper task design — realistic scenarios, clearer graders, auto-validate 08e86b6 vivekvish2004 commited on 4 days ago
fix: add /health, /metadata, /schema, /mcp endpoints + per-task graders 62fbd09 vivekvish2004 commited on 4 days ago
Final Compliance Fix: Add static tasks.json and explicit tasks reference in openenv.yaml for automated validator enumeration. 7a699bb vivekvish2004 commited on 6 days ago
Compliance alignment: Use get_tasks() method and boolean grader flags for better OpenEnv discovery. 88cf143 vivekvish2004 commited on 6 days ago
Final compliance fix: Align API with OpenEnv spec, refine logging, and update metadata. 204d70c vivekvish2004 commited on 6 days ago
Refine task validation: Explicit grader refs and tasks property. 4e82d0a vivekvish2004 commited on 6 days ago
Fix task validation: Ensure 7 tasks with graders and get_tasks() method. ce5f237 vivekvish2004 commited on 6 days ago
fix: resolve grader 500 error by ensuring ground_truth is set before scoring 7cfbb7a vivekvish2004 commited on 6 days ago
perf: optimize Docker build speed and backend startup synchronization af7f1e6 vivekvish2004 commited on 7 days ago
fix: resolve Not enough tasks with graders validation error f5bdd31 vivekvish2004 commited on 7 days ago
feat: implement step-level status tracking in environment and dashboard d5c3fab vivekvish2004 commited on 7 days ago
Env: Expand sentiment list with Happy, Panicked, Concerned, and Curious 041bc5c vivekvish2004 commited on 7 days ago
Standard API: Implement step/reset/state compliance in CustomerSupportEnv ff902bf vivekvish2004 commited on 7 days ago
Dashboard: Add AI Suggestion feature and improve Enterprise UI bc78491 vivekvish2004 commited on 7 days ago
Standard Template Restructuring: Passes openenv validate 137e754 vivekvish2004 commited on 7 days ago