Reward Models Inherit Value Biases from Pretraining ICLR2026 Collection Reward models for the paper Christian et al., "Reward Models Inherit Value Biases from Pretraining" (ICLR 2026) • 23 items • Updated 3 days ago