Miaow-Lab/RLVR-Linearity-Dataset
Viewer
•
Updated
•
40.3k
•
26
RL training and evaluation datasets, and checkpoints in 'Not All Steps are Informative: On the Linearity of LLMs’ RLVR Training'