Add model card for TAPS draft model

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +32 -1
README.md CHANGED
@@ -1 +1,32 @@
1
- arxiv.org/abs/2603.27027
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ ---
6
+
7
+ # TAPS: Task-Aware Proposal Distributions for Speculative Sampling
8
+
9
+ This repository contains a lightweight draft model introduced in the paper [TAPS: Task Aware Proposal Distributions for Speculative Sampling](https://huggingface.co/papers/2603.27027).
10
+
11
+ ## Overview
12
+ Speculative decoding accelerates autoregressive generation by letting a lightweight draft model propose future tokens that a larger target model then verifies in parallel. TAPS (Task-Aware Proposal Distributions) investigates how the draft training distribution shapes speculative decoding quality.
13
+
14
+ The research demonstrates that task-specific training yields significant specialization:
15
+ - **Task-aware specialization:** Drafts trained on specific datasets (like MathInstruct or ShareGPT) perform significantly better on domain-matched workloads.
16
+ - **Composition strategies:** The paper explores combining specialized drafters at inference time via confidence-based routing and merged-tree verification, which outperform naive weight averaging.
17
+
18
+ This model is a lightweight (~0.8B parameters) Llama-style drafter designed to be used in conjunction with a larger verifier model, such as Meta-Llama-3-8B-Instruct.
19
+
20
+ ## Resources
21
+ - **Paper:** [https://arxiv.org/abs/2603.27027](https://arxiv.org/abs/2603.27027)
22
+ - **GitHub Repository:** [https://github.com/Moe-Zbeeb/TAPS](https://github.com/Moe-Zbeeb/TAPS)
23
+
24
+ ## Citation
25
+ ```bibtex
26
+ @article{zbib2026taps,
27
+ title={TAPS: Task Aware Proposal Distributions for Speculative Sampling},
28
+ author={Zbib, Mohamad and Bazzi, Mohamad and Mohanna, Ammar and Ghanem, Bernard and Hammoud, Hasan Abed Al Kader},
29
+ year={2026},
30
+ note={Technical report}
31
+ }
32
+ ```