Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
a8cheng 's Collections
3D Aware Region Prompted Vision Language Model
NaVILA: Legged Robot Vision-Language-Action Model for Naviga
SpatialRGPT: Grounded Spatial Reasoning in VLMs

SpatialRGPT: Grounded Spatial Reasoning in VLMs

updated Oct 11, 2024
Upvote
5

  • a8cheng/SpatialRGPT-VILA1.5-8B

    Updated Oct 6, 2024 • 8 • 7

  • a8cheng/OpenSpatialDataset

    Updated Oct 3, 2024 • 91 • 13

  • a8cheng/SpatialRGPT-Bench

    Viewer • Updated May 5, 2025 • 1.41k • 566 • 12
Upvote
5
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs