Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
rhinocamp
/
nanoVLM
like
0
Image-Text-to-Text
Safetensors
nanovlm
vision-language
multimodal
research
License:
mit
Model card
Files
Files and versions
xet
Community
main
nanoVLM
888 MB
1 contributor
History:
6 commits
rhinocamp
Upload nanoVLM using push_to_hub
555a9bc
verified
7 months ago
.gitattributes
1.52 kB
initial commit
7 months ago
README.md
1.09 kB
Upload nanoVLM using push_to_hub
7 months ago
config.json
1.01 kB
Upload nanoVLM using push_to_hub
7 months ago
model.safetensors
888 MB
xet
Upload nanoVLM using push_to_hub
7 months ago