TIGER-Lab
/

PixelReasoner-RL-v1

Image-Text-to-Text

text-generation-inference

Model card Files Files and versions

JasperHaozhe commited on May 24, 2025

Commit

477cc28

·

verified ·

1 Parent(s): ea66ff2

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -10,5 +10,10 @@ base_model:
 - Qwen/Qwen2.5-VL-7B-Instruct
 pipeline_tag: question-answering
 ---
-This is the model trained with https://github.com/TIGER-AI-Lab/Pixel-Reasoner/.

 - Qwen/Qwen2.5-VL-7B-Instruct
 pipeline_tag: question-answering
 ---
+The model is trained with curiosity-driven RL described in [paper](https://arxiv.org/abs/2505.15966).
+We have released vllm based inference code at https://github.com/TIGER-AI-Lab/Pixel-Reasoner/.
+We will release a simple hf.generate() based inference code.
+Please also play with the cool [interactive demo](https://huggingface.co/spaces/TIGER-Lab/Pixel-Reasoner)