Upload folder using huggingface_hub

Files changed (1) hide show

README.md CHANGED Viewed

@@ -38,11 +38,6 @@ Built on top of MOSS-VL-Base-0408 through supervised fine-tuning (SFT), this che
 - 🖼️ **Strong General Multimodal Perception** — Robust image understanding, fine-grained object recognition, OCR, and document parsing.
 - 💬 **Reliable Instruction Following** — Substantially improved alignment with user intent through supervised fine-tuning on diverse multimodal instruction data.
-### 📝 Note on Variants
-> [!IMPORTANT]
-> **This is the offline instruction-tuned checkpoint.** It is not the streaming variant. If you are looking for low-latency, real-time interactive video understanding, please refer to the upcoming **MOSS-VL-RealTime** release.
 ---

 - 🖼️ **Strong General Multimodal Perception** — Robust image understanding, fine-grained object recognition, OCR, and document parsing.
 - 💬 **Reliable Instruction Following** — Substantially improved alignment with user intent through supervised fine-tuning on diverse multimodal instruction data.
 ---