Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
internlm
/
Spatial-SSRL-7B
like
10
Follow
Intern Large Models
853
Image-Text-to-Text
Transformers
Safetensors
internlm/Spatial-SSRL-81k
English
qwen2_5_vl
image-to-text
multimodal
spatial
sptial understanding
self-supervised learning
conversational
text-generation-inference
arxiv:
2510.27606
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Spatial-SSRL-7B
/
assets
15.6 MB
1 contributor
History:
4 commits
yuhangzang
Upload comparison_v2.png
cf00570
verified
18 days ago
case1.jpg
336 kB
xet
Upload 2 files
about 1 month ago
case2.jpg
235 kB
xet
Upload 2 files
about 1 month ago
comparison_v2.png
Safe
3.37 MB
xet
Upload comparison_v2.png
18 days ago
exp_result.png
216 kB
xet
Add files using upload-large-folder tool
about 1 month ago
pipeline_1029final.png
Safe
7.84 MB
xet
Add files using upload-large-folder tool
about 1 month ago
teaser_1029final.png
Safe
3.58 MB
xet
Add files using upload-large-folder tool
about 1 month ago