Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
zhibinlan
/
UME-R1-2B
like
5
Image-Text-to-Text
Transformers
Safetensors
English
qwen2_vl
image-to-text
Sentence Similarity
Embedding
zero-shot-image-classification
video-text-to-text
conversational
text-generation-inference
arxiv:
2511.00405
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
UME-R1-2B
/
README.md
Commit History
Update README.md
e7aaa25
verified
zhibinlan
commited on
Nov 10, 2025
Update README.md
4f5dc8e
verified
zhibinlan
commited on
Nov 4, 2025
Update README.md
82b303b
verified
zhibinlan
commited on
Nov 4, 2025
update readme
e0110ca
zhibinlan
commited on
Oct 15, 2025
initial commit
0ee2570
verified
zhibinlan
commited on
Sep 29, 2025