RthItalia commited on
Commit
b8c657c
·
verified ·
1 Parent(s): ebfb4a2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -3
README.md CHANGED
@@ -68,7 +68,8 @@ Per un'azienda: **un solo Genome enterprise** + Soul intercambiabili per ogni re
68
  | **V1** | 15,000 | 1.07 | 1.5GB curated | Baseline — `zeta25b_step15000.pt` |
69
  | **V2 Repair** | 500 | 1.07 | 1.4GB SFT mix | Fine-tuning da V1 |
70
  | **V3 Knowledge** | 5,000 | 1.33 | **9.1GB** (Wiki EN/IT, C4, Books) | Espansione conoscenza |
71
- | **V4 Expanded** | 10,000 | In corso | 9.1GB | LoRA rank 512 (~950M trainabili) |
 
72
 
73
  - Hardware: **NVIDIA A40 48GB** (singola GPU)
74
  - Architettura: Fractal Gated Causal TCN, No-Attention
@@ -135,8 +136,8 @@ DOI: `10.5281/zenodo.18622610`
135
  - [x] V1: Genome + Soul baseline (15K step, loss 1.07)
136
  - [x] V2: Fine-tuning repair
137
  - [x] V3: Knowledge expansion (9.1GB dataset, Wiki EN/IT, C4, Books)
138
- - [/] V4: LoRA rank 512 (~950M trainabili) — **in training ora**
139
- - [ ] V4 Code-Only: Fork specializzato codice
140
  - [ ] GGUF v2: Conversione e upload del checkpoint V4
141
  - [ ] Scaling 50B: Stesso Genome, Soul espansa a 64 layer
142
  - [ ] Scaling 1T: Dimostrazione proof-of-concept
 
68
  | **V1** | 15,000 | 1.07 | 1.5GB curated | Baseline — `zeta25b_step15000.pt` |
69
  | **V2 Repair** | 500 | 1.07 | 1.4GB SFT mix | Fine-tuning da V1 |
70
  | **V3 Knowledge** | 5,000 | 1.33 | **9.1GB** (Wiki EN/IT, C4, Books) | Espansione conoscenza |
71
+ | **V4 Expanded** | 10,000 | **1.28** | 9.1GB | LoRA rank 512 (~950M trainabili) ✅ |
72
+ | **V5 Code** | 5,000 | In training | 4.5GB code | Code Specialist Soul 🔄 |
73
 
74
  - Hardware: **NVIDIA A40 48GB** (singola GPU)
75
  - Architettura: Fractal Gated Causal TCN, No-Attention
 
136
  - [x] V1: Genome + Soul baseline (15K step, loss 1.07)
137
  - [x] V2: Fine-tuning repair
138
  - [x] V3: Knowledge expansion (9.1GB dataset, Wiki EN/IT, C4, Books)
139
+ - [x] V4: LoRA rank 512 (~950M trainabili) — **loss 1.28, PPL 3.6** ✅
140
+ - [/] V5 Code: Code Specialist Soul (4.5GB code dataset) — **in training ora**
141
  - [ ] GGUF v2: Conversione e upload del checkpoint V4
142
  - [ ] Scaling 50B: Stesso Genome, Soul espansa a 64 layer
143
  - [ ] Scaling 1T: Dimostrazione proof-of-concept