Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
MBZUAI
/
CoME-VL
like
3
Follow
Mohamed Bin Zayed University of Artificial Intelligence
753
Image-Text-to-Text
Transformers
English
multimodal
charts
diagrams
pointing
localization
CoME-VL
arxiv:
2604.03231
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
refs/pr/1
CoME-VL
Commit History
Update README.md
b61b123
verified
ItsMaxNorm
commited on
2 days ago
Update README.md
b1c06e1
verified
ankanmbz
commited on
5 days ago
Update README.md
d189598
verified
ankanmbz
commited on
5 days ago
Update README.md
cbe6b2b
verified
ankanmbz
commited on
5 days ago
Added the assets
5be1187
verified
ankanmbz
commited on
5 days ago
Delete .DS_Store
dfd54bf
verified
ankanmbz
commited on
6 days ago
Upload CoME-VL checkpoint
0b96c50
verified
ankanmbz
commited on
6 days ago
initial commit
da124db
verified
ankanmbz
commited on
6 days ago