Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

internlm
/
Spatial-SSRL-7B

Image-Text-to-Text
Transformers
Safetensors
English
qwen2_5_vl
image-to-text
multimodal
spatial
sptial understanding
self-supervised learning
conversational
text-generation-inference
Model card Files Files and versions
xet
Community
Spatial-SSRL-7B / assets
15.6 MB
  • 1 contributor
History: 4 commits
yuhangzang's picture
yuhangzang
Upload comparison_v2.png
cf00570 verified 18 days ago
  • case1.jpg
    336 kB
    xet
    Upload 2 files about 1 month ago
  • case2.jpg
    235 kB
    xet
    Upload 2 files about 1 month ago
  • comparison_v2.png
    3.37 MB
    xet
    Upload comparison_v2.png 18 days ago
  • exp_result.png
    216 kB
    xet
    Add files using upload-large-folder tool about 1 month ago
  • pipeline_1029final.png
    7.84 MB
    xet
    Add files using upload-large-folder tool about 1 month ago
  • teaser_1029final.png
    3.58 MB
    xet
    Add files using upload-large-folder tool about 1 month ago