Classroom Final Exam: An Instructor-Tested Reasoning Benchmark Paper • 2602.19517 • Published 9 days ago • 3
Classroom Final Exam: An Instructor-Tested Reasoning Benchmark Paper • 2602.19517 • Published 9 days ago • 3
view article Article Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model Jan 1 • 18
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL Paper • 2503.07536 • Published Mar 10, 2025 • 88