| base_model: | |
| - Qwen/Qwen2.5-VL-3B-Instruct | |
| datasets: | |
| - kolerk/TON-Math-SFT | |
| language: | |
| - en | |
| license: apache-2.0 | |
| metrics: | |
| - accuracy | |
| pipeline_tag: image-text-to-text | |
| library_name: transformers | |
| This is the model cited in the paper: [Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models](https://arxiv.org/abs/2505.16854). | |
| Github: https://github.com/kokolerk/TON |