fix: clamp all grader scores to strict (0,1) open interval 71ea0d8 padmapriyagosakan commited on Apr 12
Fix: add grader property to PayOpsTask, include grader in grade_episode per_task_rewards, add per-task grader definitions to openenv.yaml 667fa47 padmapriyagosakan commited on Apr 9
fix: align step_reward with grade_episode, pin deps, update docs, clean inference 3f78483 padmapriyagosakan commited on Mar 31
feat: investigation required for full credit on hard/critical + replay seed support fb34eca padmapriyagosakan commited on Mar 31
feat: enforce investigation discipline + fix easy-task grading + add investigation_hints 622e841 padmapriyagosakan commited on Mar 30
feat: Iteration_2 — trajectory reward shaping, episode jitter, flag identification 9c003f0 padmapriyagosakan commited on Mar 27