Update README.md
Browse files
README.md
CHANGED
|
@@ -68,7 +68,8 @@ Per un'azienda: **un solo Genome enterprise** + Soul intercambiabili per ogni re
|
|
| 68 |
| **V1** | 15,000 | 1.07 | 1.5GB curated | Baseline — `zeta25b_step15000.pt` |
|
| 69 |
| **V2 Repair** | 500 | 1.07 | 1.4GB SFT mix | Fine-tuning da V1 |
|
| 70 |
| **V3 Knowledge** | 5,000 | 1.33 | **9.1GB** (Wiki EN/IT, C4, Books) | Espansione conoscenza |
|
| 71 |
-
| **V4 Expanded** | 10,000 |
|
|
|
|
| 72 |
|
| 73 |
- Hardware: **NVIDIA A40 48GB** (singola GPU)
|
| 74 |
- Architettura: Fractal Gated Causal TCN, No-Attention
|
|
@@ -135,8 +136,8 @@ DOI: `10.5281/zenodo.18622610`
|
|
| 135 |
- [x] V1: Genome + Soul baseline (15K step, loss 1.07)
|
| 136 |
- [x] V2: Fine-tuning repair
|
| 137 |
- [x] V3: Knowledge expansion (9.1GB dataset, Wiki EN/IT, C4, Books)
|
| 138 |
-
- [
|
| 139 |
-
- [
|
| 140 |
- [ ] GGUF v2: Conversione e upload del checkpoint V4
|
| 141 |
- [ ] Scaling 50B: Stesso Genome, Soul espansa a 64 layer
|
| 142 |
- [ ] Scaling 1T: Dimostrazione proof-of-concept
|
|
|
|
| 68 |
| **V1** | 15,000 | 1.07 | 1.5GB curated | Baseline — `zeta25b_step15000.pt` |
|
| 69 |
| **V2 Repair** | 500 | 1.07 | 1.4GB SFT mix | Fine-tuning da V1 |
|
| 70 |
| **V3 Knowledge** | 5,000 | 1.33 | **9.1GB** (Wiki EN/IT, C4, Books) | Espansione conoscenza |
|
| 71 |
+
| **V4 Expanded** | 10,000 | **1.28** | 9.1GB | LoRA rank 512 (~950M trainabili) ✅ |
|
| 72 |
+
| **V5 Code** | 5,000 | In training | 4.5GB code | Code Specialist Soul 🔄 |
|
| 73 |
|
| 74 |
- Hardware: **NVIDIA A40 48GB** (singola GPU)
|
| 75 |
- Architettura: Fractal Gated Causal TCN, No-Attention
|
|
|
|
| 136 |
- [x] V1: Genome + Soul baseline (15K step, loss 1.07)
|
| 137 |
- [x] V2: Fine-tuning repair
|
| 138 |
- [x] V3: Knowledge expansion (9.1GB dataset, Wiki EN/IT, C4, Books)
|
| 139 |
+
- [x] V4: LoRA rank 512 (~950M trainabili) — **loss 1.28, PPL 3.6** ✅
|
| 140 |
+
- [/] V5 Code: Code Specialist Soul (4.5GB code dataset) — **in training ora**
|
| 141 |
- [ ] GGUF v2: Conversione e upload del checkpoint V4
|
| 142 |
- [ ] Scaling 50B: Stesso Genome, Soul espansa a 64 layer
|
| 143 |
- [ ] Scaling 1T: Dimostrazione proof-of-concept
|