File size: 1,165 Bytes
abe0765
b62c268
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
b41f33a
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
Companion artifact for [_GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization_](https://arxiv.org/abs/2605.31464). Code: [codezakh/gpu-surrogates](https://github.com/codezakh/gpu-surrogates).

LoRA adapter for [`openai/gpt-oss-20b`](https://huggingface.co/openai/gpt-oss-20b)
fine-tuned with the correctness reward to forecast kernel speedups.

## Loading

```python
from peft import AutoPeftModelForCausalLM
model = AutoPeftModelForCausalLM.from_pretrained("codezakh/gpu-forecasters-gpt-oss-20b-correctness")
```

## Training data

[`codezakh/gpu-forecasters-rl-training-pool`](https://huggingface.co/datasets/codezakh/gpu-forecasters-rl-training-pool).

## Reproducing

See `runbook/02_train_surrogate.py` in the paper repo
([codezakh/gpu-surrogates](https://github.com/codezakh/gpu-surrogates)).

## Citation

```bibtex
@article{khan2026gpuforecasters,
  title={GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization},
  author={Khan, Zaid and Chen, Justin Chih-Yao and Cho, Jaemin and Stengel-Eskin, Elias and Bansal, Mohit},
  journal={arXiv preprint arXiv:2605.31464},
  year={2026}
}
```