feat: environment redesign — real CSV data, shaped rewards, difficulty tiers 329e3d3 aatmk-panse commited on about 1 month ago
fix: actionable hints now include real IDs (location_ids, product_ids, rules) 2bcac83 aatmk-panse commited on about 1 month ago
fix: fallback to max_completion_tokens for newer models ca40c95 aatmk-panse commited on about 1 month ago
fix: coerce string prices to float in update_variant f824b2c aatmk-panse commited on about 1 month ago
fix: inference.py runs all 3 tasks with proper [START]/[STEP]/[END] per task dce8bf6 aatmk-panse commited on about 1 month ago
feat: add /tasks and /grade endpoints for competition validator c64fcea aatmk-panse commited on about 1 month ago
fix: move all files to repo root (HF Spaces requires README.md at root) 699d744 aatmk-panse commited on about 1 month ago
Fix inference.py stdout format to match sample spec exactly da3eda7 aatmk-panse commited on about 1 month ago
Add 3 tasks with graders and clamp scores to (0, 1) c5968b0 aatmk-panse commited on about 1 month ago
Align env var names with OpenEnv sample inference spec 50cd687 aatmk-panse commited on about 1 month ago
Initial commit: Shopify store audit OpenEnv environment 362bbff aatmk-panse commited on about 1 month ago