ostapeno commited on
Commit
7012e86
·
1 Parent(s): 8925af8

report link

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -23,7 +23,7 @@ A 15B-parameter **token-mixer supernet** with **8 optimized deployment presets**
23
  - **Instruction-tuned**: targeted SFT with multiple Pareto-optimal placements
24
  - **Speculative decoding support**: use all-attention as target with efficient placements as drafts from the same checkpoint
25
 
26
- See the report for detailed benchmarks, quality retention curves, and the full story.
27
 
28
  ## Performance Overview
29
 
 
23
  - **Instruction-tuned**: targeted SFT with multiple Pareto-optimal placements
24
  - **Speculative decoding support**: use all-attention as target with efficient placements as drafts from the same checkpoint
25
 
26
+ See the [report](https://arxiv.org/abs/2604.19877) for detailed benchmarks, quality retention curves, and the full story.
27
 
28
  ## Performance Overview
29