Update README.md
Browse files
README.md
CHANGED
|
@@ -39,9 +39,9 @@ We compared the text-only checkpoint against the original Gemma 4 E2B-it across
|
|
| 39 |
|
| 40 |
| Metric | Gemma 4 E2B-it | Text-Only | Reduction |
|
| 41 |
| --- | --- | --- | --- |
|
| 42 |
-
| VRAM (MiB) |
|
| 43 |
| Parameters (B) | 5.12 | 4.65 | ~9% |
|
| 44 |
-
| File size (GB) |
|
| 45 |
|
| 46 |
> **Note:** The "E" in E2B stands for "effective" parameters. The Gemma 4 E2B architecture uses Per-Layer Embeddings (PLE) to maximize parameter efficiency on-device — the total parameter count is higher than the effective size.
|
| 47 |
> The text-only variant removes the vision and audio encoder weights while preserving the full language model, including all PLE parameters.
|
|
|
|
| 39 |
|
| 40 |
| Metric | Gemma 4 E2B-it | Text-Only | Reduction |
|
| 41 |
| --- | --- | --- | --- |
|
| 42 |
+
| VRAM (MiB) | 10,504 | 9,596 | ~9% |
|
| 43 |
| Parameters (B) | 5.12 | 4.65 | ~9% |
|
| 44 |
+
| File size (GB) | 10.20 | 9.29 | ~9% |
|
| 45 |
|
| 46 |
> **Note:** The "E" in E2B stands for "effective" parameters. The Gemma 4 E2B architecture uses Per-Layer Embeddings (PLE) to maximize parameter efficiency on-device — the total parameter count is higher than the effective size.
|
| 47 |
> The text-only variant removes the vision and audio encoder weights while preserving the full language model, including all PLE parameters.
|