Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
GrassData
/
cliptagger-12b
like
9
Follow
Grass
20
Image-Text-to-Text
English
VLM
video-understanding
image-captioning
gemma
json-mode
structured-output
video-analysis
Eval Results (legacy)
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
main
cliptagger-12b
/
assets
/
cost.png
Aidan Erickson
Upload 7 files
3735407
verified
9 months ago
download
Copy download link
history
contribute
delete
45.3 kB