XLOverflow
/

qwen3-eagle3-adaspec

@@ -18,6 +18,8 @@ Part of a course project evaluating per-step weighted loss functions for trainin
 EAGLE3 draft models. Full pipeline and source:
 **https://github.com/XLOverflow/anlp_course_project**
 ## Training
 - **Framework:** [SpecForge](https://github.com/sgl-project/SpecForge) (our fork: https://github.com/XLOverflow/SpecForge)
@@ -41,10 +43,9 @@ Baselines for reference: Vanilla ≈ 1× speedup, EAGLE-orig ≈ 2× speedup.
 - `model.safetensors` — draft model weights (~763 MB)
 - `config.json` — model config
-- Checkpoint corresponds to: `outputs/eagle3-adaspec/epoch_4_step_84000` in the original training output
-Optimizer state (`training_state.pt`, ~3 GB) is not uploaded — use the project
-repo's training scripts to resume from scratch if needed.
 ## Usage

 EAGLE3 draft models. Full pipeline and source:
 **https://github.com/XLOverflow/anlp_course_project**
+Collection: [Qwen3 EAGLE3 — Weighted Loss Variants](https://huggingface.co/collections/XLOverflow/qwen3-eagle3-weighted-loss-variants)
 ## Training
 - **Framework:** [SpecForge](https://github.com/sgl-project/SpecForge) (our fork: https://github.com/XLOverflow/SpecForge)
 - `model.safetensors` — draft model weights (~763 MB)
 - `config.json` — model config
+- Corresponds to: `outputs/eagle3-adaspec/epoch_0_step_17026` in the original training output
+Optimizer state (~3 GB) is not uploaded — use the project repo's training scripts to resume from scratch if needed.
 ## Usage