CCCCyx commited on
Commit
db9bcfa
·
verified ·
1 Parent(s): 961831a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -45,7 +45,7 @@ Built through four stages of multimodal pretraining only, this checkpoint serves
45
 
46
  ## 🏗 Model Architecture
47
 
48
- **MOSS-VL-Base-0408** adopts a cross-attention-based architecture that decouples visual encoding from cognitive reasoning. Natively supporting interleaved modalities, it provides a flexible multimodal backbone for image and video understanding while preserving a clean foundation for downstream alignment and adaptation.
49
 
50
  <p align="center">
51
  <img src="assets/structure.png" alt="MOSS-VL Architecture" width="90%"/>
 
45
 
46
  ## 🏗 Model Architecture
47
 
48
+ **MOSS-VL-Base-0408** adopts a cross-attention-based architecture that decouples visual encoding from cognitive reasoning. Natively supporting interleaved modalities, it provides a multimodal backbone for image and video understanding.
49
 
50
  <p align="center">
51
  <img src="assets/structure.png" alt="MOSS-VL Architecture" width="90%"/>