fix: raw ASGI middleware for score clamping - removes broken monkey-patch and BaseHTTPMiddleware 5f66d3a Pranesh commited on Apr 12
fix: stop fighting framework exclude, add ASGI score-clamp middleware 7209ed9 Pranesh Claude Opus 4.6 commited on Apr 12
fix: harden wire-level score clamping for Phase 2 validation 4d361f2 Pranesh Claude Opus 4.6 commited on Apr 12
fix: re-inject reward/done into obs dict so evaluator sees clamped scores bbb13fa Pranesh commited on Apr 12
fix: enforce strict (0,1) score range at all layers to pass Phase 2 validation a5f25ae Pranesh commited on Apr 12
fix: re-checkout app.py with proper UTF-8 encoding (remove null bytes) c8f4b71 Pranesh commited on Apr 12
fix: revert to passing server architecture - remove all middleware and model_serializer 2b185d5 Pranesh commited on Apr 12
fix: align STEP/END output format with hackathon spec (2dp rewards, rich action strings) d02f8d2 Pranesh commited on Apr 12
fix: stop data corruption in nuclear clamper and fix .2f rounding to 0.00/1.00 75ec67e Pranesh commited on Apr 12
feat: enforce strict reward bounds (0.0001-0.9999) and add ASGI middleware safety net be4b75a Pranesh commited on Apr 12
fix: handle None reward from openenv create_app - replace with cumulative_reward 8c8eb97 Pranesh commited on Apr 12
fix: nuclear score clamping - clamp ALL reward fields (reward, cumulative_reward, reward_delta) at 4 layers 01a2325 Pranesh commited on Apr 12
fix: correct test expectations and verification for nuclear reward clamping 6a462bd Pranesh commited on Apr 12
fix: nuclear fix β clamp ALL reward values to strict (0, 1) in ALL observations 5924dfb Pranesh commited on Apr 12
fix: remove generate_datasets.py from .dockerignore + regenerate uv.lock with server extras c52be7f Pranesh commited on Apr 11
fix: add HTTP POST /reset, /step endpoints with openenv protocol wrapping - Fallback returns {observation: {...}} matching openenv format - Step accepts {action: {...}} (openenv protocol) and flat fields - Dockerfile copies client.py, inference.py, REWARD_DESIGN.md - Dockerfile --extra server ensures fastapi/uvicorn install - uv.lock synced with pyproject.toml 8e15666 Pranesh commited on Apr 11
ULTRAPLAN: compliance fixes, grader diagnostics, stochastic mode, new corruptions e1f5cd8 Pranesh Claude Opus 4.6 commited on Apr 11
fix: remove invalid working-directory from CI workflow 5c30f45 Pranesh Claude Opus 4.6 commited on Apr 10
Elevate data quality env to research-grade: 6-phase enhancement f8049d5 Pranesh Claude Opus 4.6 commited on Apr 10
fix: add defense-in-depth score clamping and push to HF Space 216ccf9 Pranesh Claude Opus 4.6 commited on Apr 9
fix: clamp terminal observation scores to (0, 1) exclusive for validator compliance f194691 Pranesh commited on Apr 8
fix: clamp all score paths in inference.py to (0, 1) exclusive af7d5e5 Pranesh Claude Opus 4.6 commited on Apr 8
fix: clamp task scores to (0, 1) exclusive β validator rejects 0.0 and 1.0 a5544fc Pranesh Claude Opus 4.6 commited on Apr 8
fix: handle LiteLLM 400 errors and reduce context for small models dc7bac4 Pranesh Claude Opus 4.6 commited on Apr 8
fix: prevent eval timeout β reduce timeouts, retries, and add deadline 7017097 Pranesh Claude Opus 4.6 commited on Apr 8
fix: add root Dockerfile for HuggingFace Spaces deployment 3d27060 Pranesh Claude Opus 4.6 commited on Apr 8
ci: add GitHub Action to auto-sync to HuggingFace Space b163c73 Pranesh Claude Opus 4.6 commited on Apr 8
fix: resolve inference.py crash from context overflow and 400 fast-fail c8f4dc4 Pranesh Claude Opus 4.6 commited on Apr 8
fix: prevent non-zero exit on LiteLLM proxy 400 errors b84c108 Pranesh Claude Opus 4.6 commited on Apr 8
feat: harden RL signal and grader quality for hackathon submission 3db9e5d Pranesh Claude Opus 4.6 commited on Apr 8
fix: add explicit HF_TOKEN and LOCAL_IMAGE_NAME env var declarations 513a5c3 Pranesh Claude Opus 4.6 commited on Apr 8
feat: elevate environment to hackathon-winning quality b3e45cf Pranesh Claude Opus 4.6 commited on Apr 8
feat: Data Quality RL Environment β production-ready submission fc4c35a Praneshrajan137 commited on Apr 6