Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
OX-PIXL
/
SpatialThinker-7B
like
1
Follow
Perceptual Intelligence and Extended Reality Lab
6
Image-Text-to-Text
Safetensors
English
qwen2_5_vl
spatial-reasoning
multimodal
vision-language
scene-graph
reinforcement-learning
conversational
arxiv:
2511.07403
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
4
Update README with model/dataset documentation
#2
by
hunarbatra
- opened
2 days ago
base:
refs/heads/main
←
from:
refs/pr/2
Discussion
Files changed
+161
-200
hunarbatra
Perceptual Intelligence and Extended Reality Lab org
2 days ago
No description provided.
Update README with model/dataset documentation
059d66ae
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Cannot merge
This branch has merge conflicts in the following files:
README.md
Comment
·
Sign up
or
log in
to comment