Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
l3nux
/
nanoVLM_vqa
like
1
Image-Text-to-Text
Safetensors
nanovlm
vision-language
multimodal
research
License:
mit
Model card
Files
Files and versions
xet
Community
main
nanoVLM_vqa
912 MB
1 contributor
History:
2 commits
l3nux
Upload nanoVLM using push_to_hub
9b00b0a
verified
2 months ago
.gitattributes
1.52 kB
initial commit
2 months ago
README.md
1.09 kB
Upload nanoVLM using push_to_hub
2 months ago
config.json
3.54 kB
Upload nanoVLM using push_to_hub
2 months ago
model.safetensors
912 MB
xet
Upload nanoVLM using push_to_hub
2 months ago