internlm
/

CapRL-InternVL3.5-8B

Image-Text-to-Text

Model card Files Files and versions

yuhangzang commited on Oct 16, 2025

Commit

f5e00bb

·

verified ·

1 Parent(s): c1fdc60

Update README.md

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -18,12 +18,13 @@ tags:
   🤗<a href="https://huggingface.co/collections/long-xing1/caprl-68d64ac32ded31596c36e189">CapRL Collection</a> | 🤗<a href="https://huggingface.co/papers/2509.22647">Daily Paper</a>
-Based on the same recipe as CapRL-3B, we used InternVL3.5-8B as the policy model and obtained **CapRL-InternVL3.5-8B** through CapRL. **Its performance significantly surpasses that of Qwen2.5-VL-72B**.
-We are working on even stronger base models and upgrading our training recipe — stay tuned!
-CapRL-3B-GGUF is static quants version, and CapRL-3B-i1-GGUF is weighted/imatrix quants version. Thanks for their contribution!
 ## Introduction
 We are excited to introduce CapRL-3B, a lightweight 3B image captioner that achieves perception capabilities comparable to Qwen2.5-VL-72B.

   🤗<a href="https://huggingface.co/collections/long-xing1/caprl-68d64ac32ded31596c36e189">CapRL Collection</a> | 🤗<a href="https://huggingface.co/papers/2509.22647">Daily Paper</a>
+📢 News
+- 🚀 Based on the same recipe as CapRL-3B, we used InternVL3.5-8B as the policy model and obtained **CapRL-InternVL3.5-8B** through CapRL. **Its performance significantly surpasses that of Qwen2.5-VL-72B**.
+- 🚀 We are working on even stronger base models and upgrading our training recipe — stay tuned!
+- 🚀 CapRL-3B-GGUF is static quants version, and CapRL-3B-i1-GGUF is weighted/imatrix quants version. Thanks for their contribution!
 ## Introduction
 We are excited to introduce CapRL-3B, a lightweight 3B image captioner that achieves perception capabilities comparable to Qwen2.5-VL-72B.