abcsk123 commited on
Commit
2f654b9
·
verified ·
1 Parent(s): ce5016a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -14,7 +14,7 @@ language:
14
  - zh
15
  ---
16
 
17
- # Qwen2.5-Coder-1.5B-Hybrid-v9
18
 
19
  ## 🌟 Model Overview
20
  This is a custom-architected model based on `Qwen2.5-Coder-1.5B`. We introduced a novel **Asymmetric Hybrid Architecture (GQA + MLA)** with **Cross-Layer Shared Latent Gates** and **Attention Sinks**, enabling efficient feature communication and reduced KV-Cache memory footprint.
 
14
  - zh
15
  ---
16
 
17
+ # PyraCode-1.5B
18
 
19
  ## 🌟 Model Overview
20
  This is a custom-architected model based on `Qwen2.5-Coder-1.5B`. We introduced a novel **Asymmetric Hybrid Architecture (GQA + MLA)** with **Cross-Layer Shared Latent Gates** and **Attention Sinks**, enabling efficient feature communication and reduced KV-Cache memory footprint.