FoolDev commited on
Commit
c843f11
·
verified ·
1 Parent(s): 2c48e37

docs: animated banner.svg + dense forward-pass diagram in Architecture

Browse files
Files changed (1) hide show
  1. README.md +5 -1
README.md CHANGED
@@ -44,7 +44,7 @@ library_name: transformers
44
  pipeline_tag: image-text-to-text
45
  ---
46
 
47
- <img src="https://huggingface.co/FoolDev/janus-27b/resolve/main/banner.png" alt="Janus-27B banner" width="100%" />
48
 
49
  [![License](https://img.shields.io/badge/License-Apache_2.0-7aa2f7?style=flat&labelColor=1a1b26)](https://opensource.org/licenses/Apache-2.0)
50
  [![Base Model](https://img.shields.io/badge/Base-Qwen3.6--27B-bb9af7?style=flat&labelColor=1a1b26)](https://huggingface.co/Qwen/Qwen3.6-27B)
@@ -151,6 +151,10 @@ If you want the safetensors for `transformers`, fetch them from [`Qwen/Qwen3.6-2
151
 
152
  ## Architecture
153
 
 
 
 
 
154
  - Qwen 3.6 dense, 27B parameters, 64 transformer layers
155
  - **Hybrid attention stack**: 16 repeats of `[3 × (Gated DeltaNet → FFN) → 1 × (Gated Attention → FFN)]`
156
  - Gated DeltaNet (linear attention): 48 V-heads, 16 QK-heads, head_dim 128
 
44
  pipeline_tag: image-text-to-text
45
  ---
46
 
47
+ <img src="https://huggingface.co/FoolDev/janus-27b/resolve/main/banner.svg" alt="Janus-27B banner" width="100%" />
48
 
49
  [![License](https://img.shields.io/badge/License-Apache_2.0-7aa2f7?style=flat&labelColor=1a1b26)](https://opensource.org/licenses/Apache-2.0)
50
  [![Base Model](https://img.shields.io/badge/Base-Qwen3.6--27B-bb9af7?style=flat&labelColor=1a1b26)](https://huggingface.co/Qwen/Qwen3.6-27B)
 
151
 
152
  ## Architecture
153
 
154
+ <p align="left">
155
+ <img src="https://huggingface.co/FoolDev/janus-27b/resolve/main/dense-flow.svg" alt="animated dense forward-pass visualization: 64-layer hybrid attention stack with a pulse traversing left-to-right, illuminating Gated DeltaNet (purple) and Gated Attention (cyan) layers in turn" width="800" />
156
+ </p>
157
+
158
  - Qwen 3.6 dense, 27B parameters, 64 transformer layers
159
  - **Hybrid attention stack**: 16 repeats of `[3 × (Gated DeltaNet → FFN) → 1 × (Gated Attention → FFN)]`
160
  - Gated DeltaNet (linear attention): 48 V-heads, 16 QK-heads, head_dim 128