DiagramIR: An Automatic Pipeline for Educational Math Diagram Evaluation Paper • 2511.08283 • Published about 1 month ago
A Matter of Interest: Understanding Interestingness of Math Problems in Humans and Language Models Paper • 2511.08548 • Published about 1 month ago
From Next-Token to Mathematics: The Learning Dynamics of Mathematical Reasoning in Language Models Paper • 2407.00900 • Published Jul 1, 2024
CurLL: A Developmental Framework to Evaluate Continual Learning in Language Models Paper • 2510.13008 • Published Oct 14