Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
nvidia
/
VILA-HD-8B-PS3-4K-SigLIP
like
2
Follow
NVIDIA
58.2k
Image-Text-to-Text
Safetensors
English
llava_topdown_llama
VLM
VILA-HD
PS3
arxiv:
2503.19903
arxiv:
2412.04468
License:
cc-by-nc-sa-4.0
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
VILA-HD-8B-PS3-4K-SigLIP
17.6 GB
Ctrl+K
Ctrl+K
2 contributors
History:
5 commits
bfshi-nvidia
Upload README.md with huggingface_hub
7575a20
verified
10 months ago
assets
Upload folder using huggingface_hub
10 months ago
llm
Upload folder using huggingface_hub
about 1 year ago
mm_projector
Upload folder using huggingface_hub
about 1 year ago
vision_tower
Upload folder using huggingface_hub
12 months ago
.gitattributes
1.77 kB
Upload folder using huggingface_hub
10 months ago
README.md
8.96 kB
Upload README.md with huggingface_hub
10 months ago
config.json
7.78 kB
Upload folder using huggingface_hub
about 1 year ago
trainer_state.json
575 kB
Upload folder using huggingface_hub
about 1 year ago