Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
MBZUAI
/
CoME-VL
like
3
Follow
Mohamed Bin Zayed University of Artificial Intelligence
754
Image-Text-to-Text
Transformers
English
multimodal
charts
diagrams
pointing
localization
CoME-VL
arxiv:
2604.03231
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
CoME-VL
/
.gitattributes
Commit History
Added the assets
5be1187
verified
ankanmbz
commited on
6 days ago
initial commit
da124db
verified
ankanmbz
commited on
7 days ago