On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper โข 2508.05629 โข Published Aug 7, 2025 โข 181 โข 21
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper โข 2508.05629 โข Published Aug 7, 2025 โข 181 โข 21