feat: normalize rewards to 0-1 scale for consistency with grader spec 731a8d0 Sushruth21 commited on 5 days ago