Update README.md
#4
by
bachvudinh - opened
README.md
CHANGED
|
@@ -25,13 +25,13 @@ Building on this base, **Jan-Code**, a code-tuned variant, **will be released so
|
|
| 25 |
|
| 26 |
## Model Overview
|
| 27 |
|
| 28 |
-
|
| 29 |
-
|
| 30 |
-
-
|
| 31 |
-
- Number of Parameters:
|
| 32 |
- Number of Layers: 36
|
| 33 |
- Number of Attention Heads (GQA): 32 for Q and 8 for KV
|
| 34 |
-
- Context Length: **262,144 natively**.
|
| 35 |
|
| 36 |
**Intended Use**
|
| 37 |
|
|
|
|
| 25 |
|
| 26 |
## Model Overview
|
| 27 |
|
| 28 |
+
> **Note:** Jan-v3-4B-base-instruct inherits its core architecture from **Qwen/Qwen3-4B-Instruct-2507**.
|
| 29 |
+
|
| 30 |
+
- Number of Parameters: 4.0B
|
| 31 |
+
- Number of Parameters (Non-Embedding): 3.6B
|
| 32 |
- Number of Layers: 36
|
| 33 |
- Number of Attention Heads (GQA): 32 for Q and 8 for KV
|
| 34 |
+
- Context Length: **262,144 natively**.
|
| 35 |
|
| 36 |
**Intended Use**
|
| 37 |
|