Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
kendallpark
/
maryVLM_bs256_gray_ms0_ds0
like
0
Image-Text-to-Text
nanovlm
vision-language
multimodal
research
conversational
License:
mit
Model card
Files
Files and versions
xet
Community
Training Metadata
nanoVLM
is a minimal and lightweight Vision-Language Model (VLM).
Training Metadata
Checkpoint
: 22000
Downloads last month
-
Inference Providers
NEW
Image-Text-to-Text
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support