Update model card with full architecture and training details 051c2da verified rcgalbo commited on about 23 hours ago
Upload pruned Aetheris (536M params, 80K vocab, 25.7% smaller) 2f57a9e verified rcgalbo commited on 4 days ago
Upload Aetheris model (Stage 2 best, 722M params, loss=2.73) 8b21693 verified rcgalbo commited on 4 days ago
Stage 1 complete: 10K steps, CKA layer alignment final checkpoint ba9c5e2 verified rcgalbo commited on 5 days ago
Stage 1 checkpoint: [Step 9100/10000] loss=0.0141 cka_mean=0.1489 8f808fd verified rcgalbo commited on 5 days ago
Stage 1 checkpoint: [Step 8400/10000] loss=0.0131 cka_mean=0.1052 f5ddb2a verified rcgalbo commited on 5 days ago
Stage 1 checkpoint: [Step 7100/10000] loss=0.0161 cka_mean=0.1370 eb32e03 verified rcgalbo commited on 5 days ago
Stage 1 checkpoint: [Step 6400/10000] loss=0.0199 cka_mean=0.3752 7a10927 verified rcgalbo commited on 5 days ago
Stage 1 checkpoint: [Step 5050/10000] loss=0.0500 cka_mean=0.4527 4aef24f verified rcgalbo commited on 5 days ago
Stage 1 checkpoint: [Step 4400/10000] loss=0.6573 cka_mean=0.3674 9dd17c0 verified rcgalbo commited on 5 days ago
docs: comprehensive model card with Space link and benchmarks c86da78 verified rcgalbo commited on 5 days ago
Stage 1 checkpoint: [Step 3100/10000] loss=0.0432 cka_mean=0.4536 9fb18f0 verified rcgalbo commited on 5 days ago
Stage 1 checkpoint: [Step 2450/10000] loss=0.0617 cka_mean=0.4075 7efdd2b verified rcgalbo commited on 5 days ago
Stage 1 checkpoint: [Step 1800/10000] loss=0.1092 cka_mean=0.4199 bd5a12a verified rcgalbo commited on 5 days ago