fix: explicit step() payload, urllib.error import, no **action spread e2e5e7a verified prasanthdj8 commited on Apr 10
fix: match sample format exactly — :.2f rewards, :.3f score, finally block 856fd56 verified prasanthdj8 commited on Apr 10
fix: use 0.001/0.999 bounds matching openenv.yaml score_range — app.py 8a3d177 verified prasanthdj8 commited on Apr 10
fix: use 0.001/0.999 bounds matching openenv.yaml score_range — graders.py f720bf0 verified prasanthdj8 commited on Apr 10
fix: use 0.001/0.999 bounds matching openenv.yaml score_range — inference.py 2023445 verified prasanthdj8 commited on Apr 10
fix: use 0.001/0.999 bounds matching openenv.yaml score_range — env.py b70c5c6 verified prasanthdj8 commited on Apr 10
fix: add WebSocket /ws endpoint and fix reset response structure 44fc075 verified prasanthdj8 commited on Apr 9
fix: prevent randint crash when expiry bounds are equal or inverted abb1c1a verified prasanthdj8 commited on Apr 8
fix: correct env name to retail-inventory-expiry in START log line 38983b6 verified prasanthdj8 commited on Apr 8
fix: rewards list uses final episode_score for validator consistency c339047 verified prasanthdj8 commited on Apr 8
fix: use 3dp formatting so clamped rewards never print as 0.00 or 1.00 3cdc239 verified prasanthdj8 commited on Apr 8
fix: rewrite stdout format to match required START/STEP/END spec 86140d7 verified prasanthdj8 commited on Apr 8
fix: add reward field to Observation per OpenEnv standard d1984a0 verified prasanthdj8 commited on Apr 8
fix: add reward field to Observation per OpenEnv standard 849802e verified prasanthdj8 commited on Apr 8
fix: add /metadata /schema /mcp endpoints and fix /health status 9060f18 verified prasanthdj8 commited on Apr 7