WhyDidItFail / server /graders.py

Commit History

fix: clamp all rewards and scores to [0.10, 0.90]
d3b224f

samrat-rm Claude Sonnet 4.6 commited on

fix: enforce reward bounds (0.01–0.99) and 2 decimal precision across grader, env, and inference
3781ce7

samrat-rm commited on

fix: reward scores are updated to be between 0 and 1
c130122

samrat-rm commited on

chore: clean up all the unnecessary comments
afa4b9d

samrat-rm commited on

fix: normalize underfitting gradient norms and guard vague-answer penalty
909dfde

samrat-rm commited on

feat: updating the evidence scoring function
a91fb6a

samrat-rm commited on

feat: adding steps count logic to encourage the agent explore more
17a43d0

samrat-rm commited on

feat: max step limit
1288c52

samrat-rm commited on

feat: fix suggestion is required and not providing fix causes penalty
c6888af

samrat-rm commited on

feat: ordering_bonus function implementation
236cf5b

samrat-rm commited on

feat(grade): inspected is upgraded to inspected_order. It rewards steps taken in order
a818334

samrat-rm commited on

feat: grade function refactor and additional features
740ac53

samrat-rm commited on

feat: init graders and implement grade_easy() in env
243b472

samrat-rm commited on