PyTorch
llama
File size: 1,486 Bytes
2ad02ca
 
 
e753070
2ad02ca
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
e753070
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
---
license: cc-by-nc-nd-4.0
---

# HASS Baseline Checkpoint (Used in PosS)

This repository provides a reproduced **HASS** model checkpoint that is used as a **baseline** in **PosS (Position Specialist)** experiments.

PosS is a speculative decoding method proposed in the paper:

> **PosS: Position Specialist Generates Better Draft for Speculative Decoding**

In our experiments, this HASS checkpoint serves as the baseline draft model for comparison with the proposed position-specialized draft models.

---

## πŸ”— Code

The full implementation of PosS, along with training details and evaluation scripts (including EAGLE-2 and HASS baselines), is available at:

πŸ‘‰ **GitHub:** https://github.com/shrango/PosS

---

## πŸ“¦ Files

If the model is not automatically downloaded by your framework, you may manually download the following files from this repository:

- `pytorch_model.bin` β€” model weights  
- `config.json` β€” model configuration

---

## πŸ“– Citation

If you use this checkpoint in the context of PosS or refer to the PosS method, please cite:

```bibtex
@misc{huang2025posspositionspecialistgenerates,
  title        = {POSS: Position Specialist Generates Better Draft for Speculative Decoding},
  author       = {Langlin Huang and Chengsong Huang and Jixuan Leng and Di Huang and Jiaxin Huang},
  year         = {2025},
  eprint       = {2506.03566},
  archivePrefix= {arXiv},
  primaryClass = {cs.CL},
  url          = {https://arxiv.org/abs/2506.03566}
}