Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
omnivinci
like
171
Follow
NVIDIA
50.9k
Feature Extraction
Transformers
Safetensors
vila
omni-modal
multimodal
vision
audio
video
llm
custom_code
Eval Results (legacy)
arxiv:
2510.15870
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
6
Deploy
Use this model
main
omnivinci
/
mm_projector
125 MB
2 contributors
History:
1 commit
Hanrong Ye
commit
c48c32c
25 days ago
config.json
Safe
243 Bytes
commit
25 days ago
model.safetensors
125 MB
xet
commit
25 days ago