Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
internlm
/
CapRL-Qwen3VL-2B
like
9
Follow
Intern Large Models
936
Image-Text-to-Text
Transformers
Safetensors
internlm/CapRL-2M
English
qwen3_vl
image-to-text
multimodal
image caption
captioning
conversational
arxiv:
2509.22647
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
CapRL-Qwen3VL-2B
/
assets
16.7 MB
2 contributors
History:
1 commit
yuhangzang
Add files using upload-large-folder tool
8e50d31
verified
about 1 month ago
comparison.png
3.41 MB
xet
Add files using upload-large-folder tool
about 1 month ago
info_caprl.png
3.9 MB
xet
Add files using upload-large-folder tool
about 1 month ago
info_caprl2.png
2.85 MB
xet
Add files using upload-large-folder tool
about 1 month ago
natural_caprl.png
4.21 MB
xet
Add files using upload-large-folder tool
about 1 month ago
performance.png
Safe
147 kB
xet
Add files using upload-large-folder tool
about 1 month ago
performance_caprl2_0.png
136 kB
xet
Add files using upload-large-folder tool
about 1 month ago
performance_update.png
140 kB
xet
Add files using upload-large-folder tool
about 1 month ago
teaser.png
1.88 MB
xet
Add files using upload-large-folder tool
about 1 month ago