Update EmberNet Stage 2 Epoch 3/5 | loss 4.1517 | step 1875
Browse files- README.md +5 -5
- pytorch_model.bin +1 -1
README.md
CHANGED
|
@@ -13,7 +13,7 @@ pipeline_tag: image-text-to-text
|
|
| 13 |
|
| 14 |
# EmberNet — BitNet b1.58 MoE VLM
|
| 15 |
|
| 16 |
-
> **Status:** Stage 2/2, Epoch
|
| 17 |
|
| 18 |
EmberNet is a tiny but capable Vision-Language Model built for edge deployment
|
| 19 |
and domain-expert reasoning. It combines a frozen **SigLIP** vision backbone
|
|
@@ -32,11 +32,11 @@ preserving strong visual understanding across 8 specialised domains.
|
|
| 32 |
| **Total parameters** | 840.8 M |
|
| 33 |
| **Trainable parameters** | 723.3 M |
|
| 34 |
| **Active parameters / forward** | ~235.4 M (top-2 routing) |
|
| 35 |
-
| **Carbon footprint** | 0.
|
| 36 |
| **Training stage** | Stage 2/2 — Expert SFT |
|
| 37 |
-
| **Epoch** |
|
| 38 |
-
| **Best loss** | 4.
|
| 39 |
-
| **Last updated** | 2026-03-08
|
| 40 |
|
| 41 |
---
|
| 42 |
|
|
|
|
| 13 |
|
| 14 |
# EmberNet — BitNet b1.58 MoE VLM
|
| 15 |
|
| 16 |
+
> **Status:** Stage 2/2, Epoch 3/5, Loss 4.1517
|
| 17 |
|
| 18 |
EmberNet is a tiny but capable Vision-Language Model built for edge deployment
|
| 19 |
and domain-expert reasoning. It combines a frozen **SigLIP** vision backbone
|
|
|
|
| 32 |
| **Total parameters** | 840.8 M |
|
| 33 |
| **Trainable parameters** | 723.3 M |
|
| 34 |
| **Active parameters / forward** | ~235.4 M (top-2 routing) |
|
| 35 |
+
| **Carbon footprint** | 0.6390 kg CO₂eq |
|
| 36 |
| **Training stage** | Stage 2/2 — Expert SFT |
|
| 37 |
+
| **Epoch** | 3/5 |
|
| 38 |
+
| **Best loss** | 4.1517 |
|
| 39 |
+
| **Last updated** | 2026-03-08 06:05 UTC |
|
| 40 |
|
| 41 |
---
|
| 42 |
|
pytorch_model.bin
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 3397346561
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:cf4d2c2a29d7ccced5560c8744219073d735148f900f301966665f363b5038ef
|
| 3 |
size 3397346561
|