Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
internlm
/
CapRL-3B
like
46
Follow
Intern Large Models
961
Image-Text-to-Text
Transformers
Safetensors
internlm/CapRL-2M
English
qwen2_5_vl
multimodal
image caption
captioning
conversational
text-generation-inference
arxiv:
2509.22647
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
yuhangzang
commited on
Sep 25, 2025
Commit
d2b5e1c
·
verified
·
1 Parent(s):
b42cb54
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+1
-0
README.md
CHANGED
Viewed
@@ -32,3 +32,4 @@ filtered 75K QA dataset as the training set, we obtained a highly capable captio
32
33
## Cases
34
32
33
## Cases
34
35
+
## Evaluation