BaseReward: A Strong Baseline for Multimodal Reward Model Paper โข 2509.16127 โข Published Sep 19, 2025 โข 21
MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs Paper โข 2602.12705 โข Published 28 days ago โข 65
Running on CPU Upgrade Featured 3.04k The Smol Training Playbook ๐ 3.04k The secrets to building world-class LLMs