FEST - a kaiyan289 Collection

kaiyan289 's Collections

updated May 7

Checkpoints for the paper "Boosting Reinforcement Learning with Verifiable Rewards via Randomly Selected Few-Shot Guidance"