Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
pgryko
/
nanovlm-COCO-VQAv2
like
2
Image-to-Text
Safetensors
PyTorch
HuggingFaceM4/COCO
HuggingFaceM4/VQAv2
English
vision-language-model
multimodal
nanovlm
modal-trained
image-captioning
visual-question-answering
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
nanovlm-COCO-VQAv2
1.8 GB
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
pgryko
Upload README.md with huggingface_hub
3f2ff5e
verified
10 months ago
.gitattributes
Safe
1.52 kB
initial commit
10 months ago
README.md
4.61 kB
Upload README.md with huggingface_hub
10 months ago
config.json
Safe
1.34 kB
Upload nanoVLM using push_to_hub
10 months ago
model.safetensors
1.8 GB
xet
Upload nanoVLM using push_to_hub
10 months ago