maxholsman commited on
Commit
86ac9f6
·
verified ·
1 Parent(s): 2a51a8b

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +20 -0
README.md CHANGED
@@ -6,6 +6,8 @@ license: apache-2.0
6
 
7
  Custom generate function for fuzzy speculative decoding with support for KL divergence, Jensen-Shannon divergence, and draft token-based acceptance criteria. This implementation extends the standard speculative decoding algorithm with additional divergence metrics for more flexible candidate acceptance.
8
 
 
 
9
  ## Features
10
 
11
  - **Fuzzy Speculative Decoding (FSD)**: Accepts candidate tokens based on distribution divergence thresholds
@@ -81,6 +83,24 @@ print(generated_text)
81
  - `transformers>=4.40.0`
82
  - `scikit-learn` (optional, for confidence threshold features)
83
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
84
  ## License
85
 
86
  Apache 2.0
 
6
 
7
  Custom generate function for fuzzy speculative decoding with support for KL divergence, Jensen-Shannon divergence, and draft token-based acceptance criteria. This implementation extends the standard speculative decoding algorithm with additional divergence metrics for more flexible candidate acceptance.
8
 
9
+ This implementation is based on the paper **"Fuzzy Speculative Decoding for a Tunable Accuracy-Runtime Tradeoff"** (ACL Findings 2025). See the [References](#citation) section below for full citation details.
10
+
11
  ## Features
12
 
13
  - **Fuzzy Speculative Decoding (FSD)**: Accepts candidate tokens based on distribution divergence thresholds
 
83
  - `transformers>=4.40.0`
84
  - `scikit-learn` (optional, for confidence threshold features)
85
 
86
+ ## Citation
87
+
88
+ If you use this code in your research, please cite the original paper:
89
+
90
+ ```bibtex
91
+ @article{holsman2025fuzzy,
92
+ title={Fuzzy Speculative Decoding for a Tunable Accuracy-Runtime Tradeoff},
93
+ author={Holsman, Maximilian and Huang, Yukun and Dhingra, Bhuwan},
94
+ journal={ACL Findings},
95
+ year={2025},
96
+ url={https://arxiv.org/abs/2502.20704}
97
+ }
98
+ ```
99
+
100
+ **Paper**: [Fuzzy Speculative Decoding for a Tunable Accuracy-Runtime Tradeoff](https://arxiv.org/abs/2502.20704)
101
+ **Authors**: Maximilian Holsman, Yukun Huang, Bhuwan Dhingra
102
+ **Venue**: ACL Findings 2025
103
+
104
  ## License
105
 
106
  Apache 2.0