lujangusface commited on
Commit
fb4699b
·
verified ·
1 Parent(s): 29cdd29

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -128,16 +128,16 @@ EAGLE3 trains a single-layer draft head that predicts the next token using hidde
128
 
129
  *Config: steps=3, topk=4, draft_tokens=8. 8x H200 (TP=4).*
130
 
131
- **With draft_tokens=6 (standard config, verified 2026-04-12)**:
132
 
133
  | Dataset | Baseline (tok/s) | EAGLE3 (tok/s) | Speedup |
134
  |---------|-----------------|----------------|---------|
135
- | HumanEval | 109.6 | 158.0 | **1.44x** |
136
- | Terminal-Bench | 108.9 | 150.2 | **1.38x** |
137
- | MT-Bench | 109.0 | 143.6 | **1.32x** |
138
- | SWEBench-Verified | 109.1 | 116.5 | **1.07x** |
139
 
140
- *Config: steps=3, topk=4, draft_tokens=6. 4x H200 (TP=4). Server-side Prometheus metrics.*
141
 
142
  ## Model Architecture
143
 
 
128
 
129
  *Config: steps=3, topk=4, draft_tokens=8. 8x H200 (TP=4).*
130
 
131
+ **With draft_tokens=6 (verified 2026-04-12)**:
132
 
133
  | Dataset | Baseline (tok/s) | EAGLE3 (tok/s) | Speedup |
134
  |---------|-----------------|----------------|---------|
135
+ | HumanEval | 109.6 | 177.0 | **1.61x** |
136
+ | Terminal-Bench | 108.9 | 160.8 | **1.48x** |
137
+ | MT-Bench | 109.0 | 146.8 | **1.35x** |
138
+ | SWEBench-Verified | 109.1 | 123.1 | **1.13x** |
139
 
140
+ *Config: steps=3, topk=4, draft_tokens=8. 4x H200 (TP=4). Server-side Prometheus metrics.*
141
 
142
  ## Model Architecture
143