Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning Paper • 2602.01058 • Published Feb 1 • 41
MedFrameQA: A Multi-Image Medical VQA Benchmark for Clinical Reasoning Paper • 2505.16964 • Published May 22, 2025 • 1