Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
zhaode
/
FastVLM-7B-Stage3
like
0
Image-Text-to-Text
Transformers
Safetensors
English
llava_qwen2
text-generation
multimodal
conversational
License:
apple
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
FastVLM-7B-Stage3
/
llava
/
model
/
multimodal_encoder
147 kB
1 contributor
History:
1 commit
zhaode
Upload folder using huggingface_hub
fd52dc4
verified
7 months ago
__pycache__
Upload folder using huggingface_hub
7 months ago
mobileclip
Upload folder using huggingface_hub
7 months ago
builder.py
934 Bytes
Upload folder using huggingface_hub
7 months ago
clip_encoder.py
Safe
6.7 kB
Upload folder using huggingface_hub
7 months ago
mobileclip_encoder.py
4.35 kB
Upload folder using huggingface_hub
7 months ago