preference-lab / server /environment.py

Commit History

refactor: apply production readiness recommendations including dataset caching, XSS protection, pure schemas, and JSON decoding logic.
5ee1380
Running

Sibam commited on

fix: clamp grader rewards to strictly (0, 1) to pass OpenEnv validation bounds
f3f7bc4

Sibam commited on

final: submission ready
a4c268d

Sibam commited on

fix: conform to OpenEnv base interface contract
7574c9a

Sibam commited on

Enhancement
dada51b

Sibam commited on

PreferenceLab OpenEnv environment for RLHF preference simulation
cdf485e

Sibam commited on