Update README.md
Browse files
README.md
CHANGED
|
@@ -10,5 +10,10 @@ base_model:
|
|
| 10 |
- Qwen/Qwen2.5-VL-7B-Instruct
|
| 11 |
pipeline_tag: question-answering
|
| 12 |
---
|
|
|
|
| 13 |
|
| 14 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 10 |
- Qwen/Qwen2.5-VL-7B-Instruct
|
| 11 |
pipeline_tag: question-answering
|
| 12 |
---
|
| 13 |
+
The model is trained with curiosity-driven RL described in [paper](https://arxiv.org/abs/2505.15966).
|
| 14 |
|
| 15 |
+
We have released vllm based inference code at https://github.com/TIGER-AI-Lab/Pixel-Reasoner/.
|
| 16 |
+
|
| 17 |
+
We will release a simple hf.generate() based inference code.
|
| 18 |
+
|
| 19 |
+
Please also play with the cool [interactive demo](https://huggingface.co/spaces/TIGER-Lab/Pixel-Reasoner)
|