| license: mit | |
| datasets: | |
| - VLM-Reasoner/VerMulti | |
| language: | |
| - en | |
| base_model: | |
| - Qwen/Qwen2.5-VL-3B-Instruct | |
| pipeline_tag: visual-question-answering | |
| This repository contains the model presented in [LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL](https://huggingface.co/papers/2503.07536). | |
| Project page: https://forjadeforest.github.io/LMM-R1-ProjectPage |