Zubenelgenubi
Collection
3 items • Updated
Trained from scratch on mixed data with dual-alpha distillation:
| Stream | Dataset | Alpha | Purpose |
|---|---|---|---|
| General | FineWeb-Edu | 0.2 | Language modeling, light teacher guidance |
| Reasoning | GSM8K chain-of-thought | 0.8 | Heavy distillation: teacher guides step-by-step math reasoning |