Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
OX-PIXL
/
SpatialThinker-7B
like
1
Follow
Perceptual Intelligence and Extended Reality Lab
6
Image-Text-to-Text
Safetensors
English
qwen2_5_vl
spatial-reasoning
multimodal
vision-language
scene-graph
reinforcement-learning
conversational
arxiv:
2511.07403
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
4
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (2)
Sort: Recently created
Update README with model/dataset documentation
#2 opened 1 day ago by
hunarbatra
Enhance model card: Metadata, links, and usage example
#1 opened 2 months ago by
nielsr