RLVR Linearity - a Miaow-Lab Collection

Miaow-Lab 's Collections

RLVR Linearity

updated May 22

RL training and evaluation datasets, and checkpoints in 'Linear Dynamics in the RLVR Training of Large Language Models'

Not All Steps are Informative: On the Linearity of LLMs' RLVR Training

Paper • 2601.04537 • Published Jan 8
Miaow-Lab/RLVR-Linearity-Dataset

Viewer • Updated May 22 • 40.3k • 50
Miaow-Lab/RLVR-Linearity-Checkpoints

Text Generation • Updated May 22