Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
wingrune
/
3DGraphLLM
like
0
Image-Text-to-Text
Transformers
3d-scene-understanding
scene-graph
multimodal
vlm
llama
vision-language-model
arxiv:
2412.18450
License:
mit
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
d3fc72d
3DGraphLLM
80.2 kB
3 contributors
History:
4 commits
wingrune
Update README.md
d3fc72d
verified
about 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
about 1 year ago
README.md
Safe
1.09 kB
Update README.md
about 1 year ago
ga.png
Safe
77.5 kB
Upload ga.png
about 1 year ago