TON-3B-AITZ / README.md
nielsr's picture
nielsr HF Staff
Add library name, link to code
3b1aa1d verified
|
raw
history blame
392 Bytes
metadata
base_model:
  - Qwen/Qwen2.5-VL-3B-Instruct
datasets:
  - kolerk/TON-AITZ-SFT
language:
  - en
license: apache-2.0
pipeline_tag: image-text-to-text
library_name: transformers

This is the model cited in the paper: Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models.

Github repository: https://github.com/kokolerk/TON