LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL
Paper
•
2503.07536
•
Published
•
88
A human-aligned video caption scorer model proposed in Cockatiel and trained on Cockatiel-4K.
For more details, please refer to our project page: https://sais-fuxi.github.io/projects/cockatiel