codezakh
/

gpu-forecasters-gpt-oss-20b-correctness

Model card Files Files and versions

gpu-forecasters-gpt-oss-20b-correctness / README.md

codezakh's picture

Add model card.

abe0765 verified 4 days ago

|

history blame contribute delete

1.17 kB

	Companion artifact for [_GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization_](https://arxiv.org/abs/2605.31464). Code: [codezakh/gpu-surrogates](https://github.com/codezakh/gpu-surrogates).

	LoRA adapter for [`openai/gpt-oss-20b`](https://huggingface.co/openai/gpt-oss-20b)
	fine-tuned with the correctness reward to forecast kernel speedups.

	## Loading

	```python
	from peft import AutoPeftModelForCausalLM
	model = AutoPeftModelForCausalLM.from_pretrained("codezakh/gpu-forecasters-gpt-oss-20b-correctness")
	```

	## Training data

	[`codezakh/gpu-forecasters-rl-training-pool`](https://huggingface.co/datasets/codezakh/gpu-forecasters-rl-training-pool).

	## Reproducing

	See `runbook/02_train_surrogate.py` in the paper repo
	([codezakh/gpu-surrogates](https://github.com/codezakh/gpu-surrogates)).

	## Citation

	```bibtex
	@article{khan2026gpuforecasters,
	title={GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization},
	author={Khan, Zaid and Chen, Justin Chih-Yao and Cho, Jaemin and Stengel-Eskin, Elias and Bansal, Mohit},
	journal={arXiv preprint arXiv:2605.31464},
	year={2026}
	}
	```