abcsk123 commited on
Commit
4c6edb8
·
verified ·
1 Parent(s): 86c7b2f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -25,8 +25,8 @@ tags:
25
  This is a custom-architected model based on `Qwen2.5-Coder-1.5B`. We introduced a novel **Asymmetric Hybrid Architecture (GQA + MLA)** with **Cross-Layer Shared Latent Gates** and **Attention Sinks**, enabling efficient feature communication and reduced KV-Cache memory footprint.
26
 
27
  ## 🏗️ Architecture Innovations
28
- *(这里插入你用 picture.py 生成的架构图,可以把图片拖进 Hugging Face 网页版的编辑框里自动生成链接)*
29
- ![Hybrid Architecture](填入你的图片链接)
30
 
31
  Unlike standard Qwen2 models, this `Hybrid-v9` backbone features:
32
  1. **Asymmetric Layers:**
 
25
  This is a custom-architected model based on `Qwen2.5-Coder-1.5B`. We introduced a novel **Asymmetric Hybrid Architecture (GQA + MLA)** with **Cross-Layer Shared Latent Gates** and **Attention Sinks**, enabling efficient feature communication and reduced KV-Cache memory footprint.
26
 
27
  ## 🏗️ Architecture Innovations
28
+ ![Uploading qwen2_hybrid_arch.png…]()
29
+
30
 
31
  Unlike standard Qwen2 models, this `Hybrid-v9` backbone features:
32
  1. **Asymmetric Layers:**