File size: 399 Bytes
f9db444 2d683fe f9db444 2d683fe f9db444 2d683fe f9db444 2d683fe |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 |
---
base_model:
- Qwen/Qwen2.5-VL-7B-Instruct
datasets:
- kolerk/TON-Math-SFT
language:
- en
license: apache-2.0
metrics:
- accuracy
pipeline_tag: image-text-to-text
library_name: transformers
---
This is the model cited in the paper: [Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models](https://arxiv.org/abs/2505.16854).
Code: https://github.com/kokolerk/TON |