Use real LLM call for proxy check + baseline scores for task validation 5e3e79e junaid0600 commited on Apr 10
Clean inference.py using baseline scores strictly between 0 and 1 b02ec3c junaid0600 commited on Apr 10
Fix rewards never exactly 0.0 or 1.0 using proper normalization 7dff36b junaid0600 commited on Apr 10
Update inference.py with [START]/[STEP]/[END] format and dotenv loading 5447299 junaid0600 commited on Apr 5
Fix openenv validate: add uv.lock, openenv-core, server entry point df7bda2 junaid0600 commited on Mar 29